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CONTEXT DEPENDENT INHIBITORS OF CYTIDINE DEAMINASES AND USES 

THEREOF 

This invention was made with government support under Grants DK43739 and 
5 AI54369 and RR15934 awarded by the National Institutes of Health, and a grant from the 
Air Force Office for Scientific Research. Therefore, the government may have certain rights 
in the invention. 

CROSS REFERENCE TO RELATED APPLICATIONS 

10 This application claims priority to U.S. Provisional Application 60/586,490, filed 

May 6, 2004. 

BACKGROUND OF THE INVENTION 

Cytidine deaminases (CD As) represent a novel class of enzymes involved in 

15 pyrimidine metabolism in both lower and higher organisms. The fundamental cytidine 

deaminase reaction requires a cytidine deaminase with bound Zn 2+ , which serves as a Lewis 
acid in a reaction that activates a water. The activated water serves as a nucleophile that 
attacks the C4 (sp 2 ) position of the cytosine ring leading to a transition state that proceeds 
through a tetrahedral intermediate (sp 3 ). A subsequent change from the enol to keto base 

20 tautamer results in an aprotic exocylic oxygen at C4, accompanied by the elimination of 
ammonia. The product is the uridine nucleoside. The reverse reaction is unfavorable and 
does not occur to any measurable extent on the enzyme. 

The synthesis and inhibitory activity of cytidine deaminase inhibitors have been 
described in the art (Kim, C.H., et al., J Med Chem, 29(8): 1374-80, 1996; Laliberte, J. 

25 Cancer Chemother Pharmacol, 30(1): 7-11, 1992; Driscoll, J.S., et al., J Med Chem, 

34(ll):3280-4, 1991; Xiang, S., et al., Biochemistry, 36(16):4768-74, 1997; Xiang, S., et 
al., Biochemistry, 35(5):1335-41, 1996; Xiang, S., et al., Biochemistry, 34(14):4516-23, 
1995; Frick, L., et al., Biochemistry, 28(24):9423-30, 1989.) 

Historically the development of the zebularine nucleoside was based on the need for 

30 an inhibitory agent to impair the function of cytidine deaminases that metabolize 
antileukemic nucleosides such as 5-aza-2 , -deoxycytidine (5-aza-dC) and cytidine 
arabinoside (ara-C) (Foubister, V., Drug Discov Today 8(10):430-1, 2003.) Such 
undesirable cytidine deaminase activity renders these therapeutics ineffective as anti- 

1 
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leukemia drugs. By comparison to 5-aza-dC and ara-C, zebularine is considerably less 
cytotoxic and more chemically stable, up to pH 12, in aqueous solution (Barchi, J.L, Jr., J. 
Organic Chem., 57:536-541, 1992; Kelley, J.A., et al., J Med Chem, 29(ll):2351-8, 1986). 
Interest in zebularine as a therapeutic has emerged due to its efficacy as an anti-DNA 
5 methyltransferase drug Cheng, J.C., et al., J Natl Cancer Inst, 95(5):399-409, 2003; Zhou, 
L., et al., J Mol Biol, 321(4):591-9, 2002). In many types of cancers, tumor suppressor 
genes become inactivated due to abnormal methylation of their promoter regions. Clinical 
studies have demonstrated that oral administration of zebularine can reactivate genes that 
naturally suppress cancer through a mechanism that inhibits methylation. However, in 

1 0 animal studies, the efficacy of zebularine in free nucleoside form was limited due to the 
very high oral doses needed (lg per kg body weight in mouse models Foubister, V., Drug 
Discov Today, 8(10):430-1, 2003). Furthermore, zebularine is cytotoxic due to its non- 
specific action against the enzymes of pyrimidine metabolism and indiscriminate targeting 
of DNA methyltransferases. What is needed in the art are inhibitors that selectively target 

1 5 cytidine deaminases with the property that inhibition has little or no cytotoxicity. 

In this regard, cytidine deaminases active on cytidine or deoxycytidine in RNA or 
DNA (respectively) and that belong to a family of enzymes known as APOBEC-1 Related 
Proteins (ARPs) have been shown to play a role in modifying nucleic acid sequences and 
thereby giving rise to altered protein expression and/or serving as antiviral agents. ARPs 

20 modify select cytidines or deoxycytidines in RNA or DNA through targeting sequence 
specific signals within 5' and/or 3 1 sequences flanking the modified base. The flanking 
sequences represent a unique context within which the target cytidine of deoxycytidine is 
embedded. 

SUMMARY 

25 The invention discloses context dependent inhibitors of cytidine deaminases, 

(Cytidine deaminase inhibitors) and methods for their use. 

BRIEF DESCRIPTION OF THE DRAWINGS 

The accompanying drawings, which are incorporated in and constitute a part of this 
30 specification, illustrate several embodiments of the invention and together with the 
description, serve to explain the principles of the invention. 

Figure 1 shows a summary of the relevant chemical mechanism and stereochemical 
aspects of the fundamental cytidine deaminase reaction. The noteworthy aspect of the 
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reaction is that it requires a cytidine deaminase with bound Zn , which serves as a Lewis 
acid in the reaction to activate water. Activated water serves as a nucleophile that attacks 
the C4 (sp 2 ) position of the cytosine ring leading to a transition state that proceeds through a 
tetrahedral intermediate (sp 3 ), for which the enzyme exhibits its greatest affinity. A change 
5 from enol to keto base tautamer results in a keto oxygen at C4, accompanied by the 

elimination of ammonia. The product is the uridine nucleoside. The reverse reaction is 
unfavorable and does not occur to any measurable extent on the enzyme. 

Figure 2 shows the stereochemical property of zebularine that makes it such an 
effective inhibitor of cytidine deaminase activity. The enzyme modifies the analog base in a 

10 mechanism-based manner. Specifically, when the zebularine nucleoside is bound by the 
enzyme active site, the de-oxo C4 position undergoes nucleophilic attack in a manner 
analogous to that observed for the cytidine substrate seen in Figure 1. However, due to the 
absence of a suitable leaving group (i.e. hydrogen in lieu of ammonia), the enzyme remains 
trapped in complex with the C4-hydroxylated analog, which mimics the tetrahedral 

1 5 geometry of the transition state. Hence, the enzyme exhibits much greater affinity for the 
3,4 dihydrouridine adduct than the zebularine ground- state. Therefore, an inhitor need not 
be one that is chemically susceptible to nucleophilic attack, but can be one that mimics the 
chemical properties of the intermediate in geometry and charge. 

Figure 3 shows how representative substrate specificity is achieved for the cytidine 

20 deaminase APOBEC-1 whose target site interaction is conferred through direct interaction 
with an auxiliary factor, ACF. The latter complementation factor binds APOBEC-1, as well 
as an RNA sequence that flanks the deaminated site (C6666) known as the * mooring 
sequence. RNA recognition by ACF requires 3 tandem RNA recognition motifs (RRMs) 

that interact with high affinity (K^ = ~ 8 nM) on the mooring sequence. Due to their 

25 sequence homology to APOBEC-1, APOBEC related proteins (ARPs) can require similar 
high-specificity auxiliary factors in substrate targeting. One analysis of the ARP known as 
the activation induced deaminase (AID) showed that deoxycytidine is targeted for 
modification within immunoglobulin genes in the context of single stranded DNA sequence 
(A/G-G-C-A/T (RGYW) or A/T-A/G-C-C/T (WRCY) within Ig genes for SHM, CSR and 

30 on reporter DNAs. However, many other sites were modified as well (Harris, R. S., 

Petersen-Mahrt, S. K. and Neuberger, M. S. (2002). Mol Cell 10, 1247-53; Petersen-Mahrt, 
S. K., et al., (2002). Nature 418, 99-104; Yu, K., et al., (2003). J. Biol Chem. 279, 6496- 
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6500. Targeting of deoxycytidines within the HIV-1 single stranded minus strand 
intermediates during viral reverse transcription demonstrated a target sequence preference 
of ACCC or GCCC in which the third C is deaminated (Zhang et al, (2003) Nature 424,94- 
8; Lecossier et al., Science 300, 1 1 12). 
5 In the editosome deamination model, APOBEC-1 is the core editing enzyme of a 

macromolecular complex required for site specific C to U deamination of the mRNA 
encoding the serum lipoprotein apoB. By itself APOBEC-1 exhibits little RNA binding 
specificity; however, through interactions with APOBEC-1 complementation factor, ACF, 
the editing enzyme is guided to the correct target site to perform its natural biochemical C to 
10 U deamination function because ACF binds selective to the mooring sequence. This 

paradigm of auxiliary proteins and their selective interaction with target sequences is a 
theme with the ARP family described throughout.. Figure 4 shows several other context 
dependent inhibitors (Cytidine deaminase inhibitors) of ARP-related cytidine deaminases 
other than zebularine. 

15 Figure 5 shows a description of the sequence environment employed for the 

synthesis and application of cytidine deaminase inhibitors containing embedded zebularine. 
The single stranded construct is envisioned for APOBEC-1, which entails incorporating 
specific sequences of the tripartite apoB mRNA substrate; Z marks the position of 
zebularine. A single-stranded or double-stranded "bubble" target provides a substrate mimic 

20 for AID. 

Figure 6 shows that the expression of CEM15 in 293T cells resulted in at least a 
100- fold decrease in Vif- viral infectivity compared to particles generated in parental 293T 
cells. The low level of GFP expression from vif(-), CEM15+ particles is indistinguishable 
from background fluorescence in control cells [0.2%]. This assay is amenable to the use of 

25 existing HIV-1 proviral isotyped vectors that are deleted for different regions and different 
amounts of the HIV-1 genome. Deleted genes can be provided in trans by co-transfection of 
suitable expression plasmids. 

Figure 7 shows expression of AID transcripts in human B cell lymphoma lines. 
Total RNA was extracted from the cell lines indicated, reverse transcribed and subjected to 

30 PCR amplifications with primers to AID and p-actin as described (McCarthy et al. (Blood 
(94) Feb. 13, 2003). Amplified fragments were separated on 1.2% agarose gels. AID PCR 
products were then transferred onto Hybond nylon filters, hybridized to a 32 P-labeled 
internal AID oligonucleotide, and the hybridization signal was detected using 
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phosphorimager screens. Note the different AID band patterns present in the individual 
lymphoma lines. (wt, wild-type-size AID band (646bp); svl/sv2, bands compatible with 
size of known splice variants; *, potential novel splice variants.) 

Figure 8 shows retroviral expression of AID. His-tagged murine AID cDNA was 
5 cloned into the pMIG retroviral vector (Pear et al. Blood 92:378-3792, 1998) and retroviral 
particles were generated using the Phoenix-Eco packaging cell line. Viral supenatants were 
used to transduce in vitro activated B cell. Panel A: Dayl LPS-activated lymphoblasts were 
incubated with Pheonix-Eco virus-free supernatant (No virus), pMIG virus containing 
supernatants and pMIG-AID containing supernatants, in the presence of 4 jag/ml polybrene. 

10 At day 5 of culture, cells were harvested and analyzed by flow cytometry (FSC/SSC live- 
gated cells shown in plots). Transduction efficiencies average 30-40% with the pMIG virus, 
and 20-30% with pMIG-AID. Note that GFP expression is lower for pMIG-AID than for 
pMIG, probably because the AID in the first cistron position diminished expression of the 
IRES-GFP module (therefore, transduction efficiencies with pMIG-AID may be somewhat 

15 underestimated). Panel B: Day 5 cells from oCD40- (one experiment) and LPS-stimulated 
cultures transduced with either pMIG (open bars) or pMIG-were stained with PE- 
conjugated anti-IgG antibodies (algGl for CD40 culture, algG2b/aIgG3 for the LPS 
cultures), analyzed by flow cytometry, and rates of IgG switching in transduced, GFP+ cells 
compared. Note how pMIG-AID transduction increases IgG switching rates by 2-4 folds, 

20 indicating expression of functional AID. 

Figure 9 shows retroviral transduction of human B lymphoma cells. Cultured cells 
from the indicated lymphoma lines were transduced with an RD114-packaged GFP- 
expressing retrovirus using the flow-through method (Chuck et al. Hum Gene Ther 7:743- 
750, 1996). 48 hours after transduction, cells were analyzed by flow cytometry for GFP 

25 expression. Efficiencies vary between 20% and >80%. 

Figure 10 Ribbon representations of /ZsAPOBEC-1 and HsATD comparative 
models. (A) The dimeric APOBEC-1 model with polypeptide chains colored: purple and 
red (NTCD and NCCTD); and blue and green (NTCD and NCCTD). A central flap (cyan 
with hatched oval) connects the NTCD to the NCCTD. Each NTCD coordinates Zn 2+ (dark 

30 green sphere). Trans-acting structure elements that form the purple active site are: Tl and 
L2 (green NCCTD), flap (cyan with hatched circle) and T3 (blue NTCD). The symmetry 
axis in black represents a proper dyad; the blue axes represent improper (pseudo) 2-fold 
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rotations. (B) The dimeric AID structure as in 2B. The subunit interface of AID obeys D 2 
symmetry analogous to APOBEC-1, but axes were omitted for clarity. 

Figure 11 Stereo views of the Hs APOBEC-1 and HsAJD active sites with bound 
RNA and DNA substrates depicted as ball-and-stick models (yellow). (A) The APOBEC-1 
5 active site with bound apoB mRNA substrate 5'-GAUAU 6 666AA-3\ (B) The APOBEC-1 
active site with bound DNA substrate 5'-d(ATCTC*CG)-3\ described as an rpoB mutation 
hot spot (13). (C) The AID active site with bound DNA substrate 5'-d(TAAGU*TA)-3\ 
described as an SHM hot spot (14). Residues mutated in HIGM2 syndrome are colored red. 
For all diagrams, the protein Cot backbone is drawn as a ribbon showing contributions of 

10 both polypeptides chains of the dimer (i.e. magnified views of Fig. 10A and 10B). Basic 
residues (light blue), acidic residues (pale pink), and aromatic residues (gray) are drawn as 
stick representations. Predicted hydrogen bonds and ionic interactions with Zn are 
depicted as black lines. The sites of base deamination are indicated by red arrowheads. For 
clarity, not all amino acids are shown. 

15 Figure 12 shows the product eluted in the synthesis of a context dependent inhibitor 

of DNA harboring 5-methyl-2'-deoxyzebularine. A sequence was synthesized was a 15- 
mer comprising: 5'-d(AGC-TAG-(dmZ)-TAA-GTT-AT)-3' (SEQ ID NO: 22), where one 
of the edited positions has been replaced by 5-methyl-2'-deoxyzebularine denoted (dmZ) 
(Example 1). The product eluted as a single peak with a retention time of 44.3 min; failure 

20 sequences and impurities were clearly separated in this step. The DNA was detected at 260 
nm and the pooled material was lyophilized to dryness. The yield was 25% (nearly 2 mg). 

Figure 13 shows MALDI/TOF mass spectrometry for 5-methyl-2'-deoxyzebularine. 
The major peak was observed at an m/z of 4532. This result demonstrates the existence of 
an oligomer of the correct molecular weight that represents the largest component of the 

25 sample. 

DESCRIPTION OF THE PREFERRED EMBODIMENTS 

The invention provides compounds that inhibit cytidine deaminase, as well as 
methods of using, identifying, and making such compounds. The compounds, cytidine 
deaminase inhibitors (also referred to as Cytidine deaminase inhibitors), are useful in 
30 preventing or treating a variety of diseases as well as aiding in combination with other 
treatment therapies. Described herein are cytidine deaminase inhibitors of cytidine 
deaminases, including APOBEC-1 and APOBEC-1 related proteins (ARPs). 
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Cytidine deaminase inhibitors comprise a polymeric substrate having a targeting 
function and an inhibiting moiety. The targeting polymeric substrate can be directed to a 
cytidine deaminase or a cytidine deaminase auxiliary protein. The inhibiting moiety can be 
a nucleoside. The nucleic acid sequence of the polymeric stubstrate can contain a sequence 
5 that binds the cytidine deaminase. For example, the sequence can be a RNA or DNA and 
can flank the zebularine nucleotide 5 ! , 3* or both 5' and 3 1 (as is the case with the tripartite 
motif containg the mooring sequence within apoB mRNAi.e., the mooring sequence of 
APOBEC-1, is located 3' of the edited site at C6666, a spacer sequence lies immediately 3 ! 
of C6666 and an enhancer element lies immediately 5' of C6666). Alternately, the 

10 polymeric substrate binds the auxiliary protein The cytidine deaminase inhibitor can inhibit 
a cytidine deaminase or an auxiliary protein, by, for example, complexing with the cytidine 
deaminase or the auxiliary protein in a manner that blocks the cytidine deaminase function. 
The cytidine deaminase inhibitors can also chemically or structurally alter the cytidine 
deaminase or auxiliary protein so as to inhibit its ability to deaminate. 

1 5 Optimally, the chemical property of cytidine deaminase inhibitors that causes them 

to be effective inhibitors of cytidine deaminase activity is that the enzyme modifies the 
analog base in a mechanism-based manner. For example, when a zebularine nucleoside is 
bound by the enzyme active site, the de-oxo C4 position undergoes nucleophilic attack (Fig. 
2) in a manner analogous to that observed for the cytidine substrate (Fig. 1). However, due 

20 to the absence of a suitable leaving group (such as ammonia), the enzyme remains trapped 
in complex with the hydroxylated analog, which mimics the tetrahedral geometry 
characteristic of the transition state. Hence, the enzyme exhibits much greater affinity for 
the 3,4 dihydrouridine adduct than the zebularine ground-state. However, the nature of the 
cytidine deaminase inhibitor need not be one that requires strict chemical reactivity (defined 

25 as geometric transformation). The cytidine deaminase inhibitor functions by mimicking the 
tetrahedral intermediate preferred by the enzyme as representative of the charge and 
geometry observed in the transition state. For example, 3,4,5,6-tetrahydrouridine embedded 
within the substrate can act in an inhibitory manner. There are several examples of cellular 
and viral mRNA editing reactions in mammalian cells. (Grosjean and Benne (1998) 

30 Modification and Editing of RNA, ASM Press, Washington D.C.; Smith et al. (1997) RNA 
3: 1 105-23). Two examples of such editing mechanisms are the adenosine to inosine and 
cytidine to uridine conversions. (Grosjean and Benne, 1998; Smith et al. Trends in Genetics 
12:418-24, 1996; Krough et al. J. Mol. Biol. 235:1501-31, 1994). The latter enzymes can 
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be expected to be inhibited by cytidine deaminase inhibitors with the form of nebularine 
(which is the purine version of zebularine). Editing can also occur on both RNA and on 
DNA, and typically these functions are performed by different types of deaminases. 

Cytidine deaminases are responsible for cytidine to uridine conversions. This group 
5 of enzymes also includes deoxycytidine deaminases, which are responsible for 

deoxycytidine to uridine conversions. The term "cytidine deaminase" used throughout the 
application is interchangeable with "deoxycytidine deaminase." "Cytidine deaminases" also 
include those deaminases active on both DNA and RNA. As described in the foregoing 
section, this simplifcation stems from the fact that APOBEC-1 and ARPs can act on single 

10 stranded RNA and/or single stranded DNA. 

The most highly studied Cytidine Deaminase Active on RNA (CDAR), which acts 
upon polymeric nucleic acid substrates, is the apoB editing catalytic chain- 1, APOBEC-1 
(Wedekind, J.E., et al., Trends Genet, 19(4):207-16, 2003). The selectivity requirements of 
APOBEC-1 in editing of apoB RNA reporters was similar to that of the yeast enzyme 

15 CDD1 when the respective proteins were overexpressed in yeast (Dance et al., (2000) 
Nucleic Acids Res. 28, 424-49; Dance et al., (2001) Nucleic Acids Res. 29, 1772-80). 
Hence, the structure of CDD 1 provided a tenable connection between known CD A and 
yeast cytosine deaminase (ScCD) enzymes, and the family of mammalian APOBEC related 
proteins. Consequently, four empirically derived deaminase structures were employed in 

20 comparative modeling of Hs APOBEC-1 and HsAYD. Preparation of a structural template 
revealed major differences in the modes of substrate binding by CDA and ScCD enzymes. 
Comparative modeling of APOBEC-1 and AID suggested that the domain interface 
between subunits influences the positioning of ribose binding loops as in CD As and that a 
variably-sized flap at the catalytic site regulates access based on substrate size, which was 

25 corroborated by functional data. Both AID and APOBEC-1 targeted DNA in vivo (Harris, 
R. S. (2002) Mol. Cell 10, 1247-53; Petersen-Mahrt, S. K., (2002) Nature 418, 99-103.). 
However, APOBEC-1 did not substitute for AID in CSR or SHM (Eto, T., (2003) Proc. 
Natl. Acad. Sci. U.S.A. 100, 12895-8) and AID could not substitute for APOBEC-1 in apoB 
mRNA editing (Muramatsu, M. (1999) J. Biol. Chem. 274, 18470-6) showing each enzyme 

30 has an inherent specificity for its own substrate. Specifically, models of APOBEC-1 and 
AID (Figures 10 and 11) indicated either DNA or RNA substrates could be accommodated 
by their active sites, but only in single-stranded form. 
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APOBEC-1 is the core editing enzyme of a macromolecular complex or editosome 
required for site specific C to U deamination of the mRNA encoding the serum lipoprotein 
apoB. By itself APOBEC-1 exhibits little RNA binding specificity; however, in the 
presence of the APOBEC-1 complementation factor (ACF) the editing enzyme is guided to 
5 the correct target site to perform its natural biochemical C to U deamination function 
(Mehta, A., et al., Mol Cell Biol 20, 1846-54, 2000). APOBEC-1 target specificity is 
conferred through direct interaction with ACF, which mediates contact with a flanking 
RNA 'mooring sequence' located 3' of the edited site at C6666 (Fig. 3). RNA recognition 
by ACF requires 3 tandem RNA recognition motifs (RRMs) that interact with high affinity 
10 (Ka = - 8 nM) on the mooring sequence (Fig. 3) (Smith, H.C., Cell Biol, 4:267-78, 1993; 
Mehta, A. and D.M. Driscoll, RNA, 8(l):69-82, 2002; Blanc, V., et al., J Biol Chem, 276: 
46386-93, 2000.) 

Due to the tripartite substrate sequence motif of apoB mRNA, which includes (i) a 
mooring sequence, (ii) an "enhancer" sequence and (iii) a "spacer" sequence, which 
15 collectively flank the C6666 editing site in apoB mRNA, the site appears unique compared 
to all other mammalian mRNA, rRNA and tRNA sequences. Hence the intermolecular 
interactions between APOBEC-1, ACF, and the RNA substrate can be selectively targeted 
by incorporating the unique tripartite sequence features into the cytidine deaminase 
inhibitor. 

20 As a therapeutic, an understanding of APOBEC-1 is relevant to heart disease and 

stroke. Current lipid-lowering therapies, such as statins, have shown effective potency as 
inhibitors of hydroxymethylglutaryl (HMG)-CoA reductase, which catalyzes the committed 
step in the synthesis of cholesterol. The other lipid-lowering bile-acid-binding resin therapy 
has shown efficacy via sequestering the bile acids in the intestine, thereby interrupting the 

25 enterohepatic circulation of bile acids and increasing the elimination of cholesterol from the 
body. These are valid therapies for patients with hyperlipidemia. However, up to 30% of 
the patients have been observed to indicate adverse reactions to these therapies, showing 
that alternative therapies have a market. Moreover the latter therapies are less effective on 
patient suffering from metabolic syndrome. 

30 Cholesterol is carried in the blood from one tissue to another as lipoprotein particles 

by specific carrier proteins called apolipoproteins. Apolipoprotein B (apoB) is an integral 
and non-exchangeable structural component of lipoprotein particles referred to as 
chylomicrons, very-low-density lipoprotein (VLDL) and low-density lipoprotein (LDL). 
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ApoB circulates in human plasma as two isoforms, ApoBlOO and apoB48. ApoBlOO can 
be converted into ApoB48 by the enzyme APOBEC-1 (ApoB mRNA editing catalytic 
subunit-1). With the help of auxiliary factors, APOBEC-1 can stop the synthesis 
(production) of ApoBlOO form and start the synthesis of ApoB48 form in cells. ApoBlOO 
5 and ApoB48 play different roles in lipid metabolism. Various studies indicated that 
ApoBlOO-associated lipoproteins (VLDL and LDL) are much more atherogenic than 
ApoB48-associated lipoproteins. 

Stimulating hepatic ApoB mRNA editing is a way to reduce serum LDL through the 
reduction in synthesis and secretion of ApoBlOO containing VLDL. In most mammals 

10 (including humans), ApoB mRNA editing is carried out only in the small intestine. Studies 
found that the presence of substantial editing in liver is associated with a less atherogenic 
lipoprotein profile compared with animals that do not have liver editing activity. APOBEC- 
1 is expressed in all tissues that carry out ApoB mRNA editing. Human liver does not 
express APOBEC-1 but it does express sufficient auxiliary proteins to complement 

15 exogenous APOBEC-1 in ApoB mRNA editing in transfected cells. Therefore, induction of 
editing in human liver reduces apoBlOO synthesis and thereby reduce the levels of 
circulating LDL cholesterol. 

Based on population studies the etiology of atherosclerosis falls into four causalities 
(i) hypercholesterolemia (described above), (ii) pro-inflamatory reactions, (iii) pro clotting 

20 ,and (iv) metabolic syndrome (Corestti, J.P. et al., (2003) Atherosclerosis 111, 351-8; Moss, 
A. J. (1999) Circulation 99, 2517-22, herein incorporated by reference in their entireties). 
Cytidine deaminase inhibitor-based compounds are indicated for the latter causality as it is 
intestinally derived B48 containing reminants of chylomicrons that are small and 
accumulate within the walls of arteries due to their heightened affinity and high abundance 

25 in the blood of approximately 25% of the total population at risk for atherogenic disease. 

Chylomicron reminants appear in blood maximally within 1-2 hours following a meal (as a 
consequence of intestinal absorption) and are present in proportion to the fat content of the 
meal. 

The small intestine from all mammals edits 80-100% of apoB mRNA produced 
30 (Backus, J.W. (1990) Biochem. Biophys. Res. Commun. 170, 513-8; Greeve, J. (2003) J. 
Lipid Res. 34, 1 367-83, herein incorporated by reference in their entireties) and secretes 
B48 chylomicrons assembled in response to the presence of dietary lipids into the lymphatic 
system. Transgenic apobec-1 gene knock-out mice do not edit apoB mRNA and as a 
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consequence assemble chylomicrons on apoBlOO protein. These mice have a normal 
phenotype (Farese, R.V. et al., (1996) PNAS USA 93, 6393-8; Nakamuta, M. et al., (1996) X 
Biol. Chem. 271, 25981-8, herein incorporated by reference in their entireties), but do have 
an elevated risk of atherosclerosis due to the lack of hepatic editing. This leads to hepatic 
5 apoBlOO VLDL secretion into the blood steam and subsequent metabolism to apoBlOO 
LDL. cytidine deaminase inhibitors selective for APOBEC-1 inhibits apoB mRNA editing 
within intestinal cells thereby resulting in exclusive secretion of apoBlOO containing 
chylomicrons that are metabolized to large particles that are cleared by the liver with 
reduced rates of artery wall accumulation. 

10 In addition to the established apoB mRNA editing activity of APOBEC-1, C to U 

modifications have been detected in the tumors of approximately one quarter of individuals 
afflicted with the disease neurofibromatosis type I (NF1) (Skuse, G.R., et al., (1996) 
Nucleic Acids Res, 24,478-85, herein incorporated by reference in its entirety). The 
translation product of NF1 mRNA is a tumor suppressor GTPase activating protein (GAP), 

15 neurofibromin, which inhibits cell growth signaling mediated by the GTPase dependent Ras 
oncogene product. Due to the presence of a mooring-sequence-like segment downstream of 
C3916 in the neurofibromin mRNA, this site can be edited converting an Arg to an in- frame 
translation stop codon. If the product of the edited mRNA escapes nonsense mediated 
decay, the result is a truncated protein missing the C-terminal GAP domain providing a 

20 potential mechanism for disease. APOBEC-1 mRNA editing of nuerofibromin mRNA 

appears to play a direct role in NF1 disease (Mukhopadhyay, D., et al., (2002) Am J Hum 
Genet, 70, 38-50, herein incorporated by reference in its entiretly). Due to the unique 
tripartite motif in NF1 mRNA editing of C3916 in NF1 mRNA can be selectively inhibited 
by cytidine deaminase inhibitors. This is an instance where in the same enzyme (APOBEC- 

25 1) can be targeted by two different cytidine deaminase inhibitors (one mimicing NF1 and 
the other apoB RNA) with the possibility of tissue specific effects due to the differential 
affinity and competition of the cytidine deaminase inhibitor with the endogenous mRNA 
substrate that is selectively expressed in each tissue (e.g. NF1 cytidine deaminase inhibitor 
inhibits APOBEC-1 editing only in neuronal tissue and apoB cytidine deaminase inhibitor 

30 inhibits editing only in liver and intestine). 

Unregulated transgenic expression of APOBEC-1 confers susceptibility to liver 
cancer (Sowden, M., (1996) J Biol Chem, 271,:301 1-7; Yamanaka, S., et al., (1995) PNAS 
USA, 92, 8483-7, herein incorporated by reference in its entirety), possibly as a result of 
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nonspecific mRNA editing affecting the NAT1 translational repressor. Another novel 
mechanism leading to neoplasia that can be related to the observation that APOBEC-1 - 
mediates stabilization of mRNA transcripts of c-myc (Anant, S. and Davidson, N.O. (2000) 
Mol Cell Biol, 20, 1982-92, herein incorporated by reference in its entirety). Although 
5 APOBEC-1 itself has marginal affinity for apoB mRNA, it has been demonstrated that it 
exhibits modest affinity for tandem AU-rich sequences. Disruption of this interaction by 
cytidine deaminase inhibitor compound can decrease c-myc stabilization, which could 
significantly impact the half-life of the proto oncogene product. 

Two loci of APOBEC-1 related proteins (ARPs) were discovered in humans, as well 

10 as related proteins in mice and yeast (Dance, G.S., et al., (2001) Nucleic Acids Res, 29, 

1772-80; Anant, S., et al., (2001) Am J Physiol Cell Physiol, 281, C1904-16; Jarmuz, A., et 
al., (2002) Genomics, 79, 285-96 all herein incorporated by reference in their entireties). A 
summary of human ARP proteins is provided in Table 1 and in Wedekind, J.E., et al., 
(2003) Trends in Genet, 19, 207-16. The cytidine deaminase inhibitors described herein can 

15 target the ARPs or their auxiliary proteins in a manner similar to APOBEC-1. For example, 
context dependent inhibitors applications are specifically indicated for metabolic syndrome, 
while apoB mRNA editing is indicated for treatment in the small intestine. 

The activation-induced cytidine deaminase (AID) (e.g. a novel ARP) is required for 
somatic hypermutation (SHM) of immunoglobulin (Ig) genes in germinal center (GC) B 

20 cells (Muramatsu, M., et al., (1999) J Biol Chem, 21 A, 18470-6; Muramatsu, M., et al., 
(2000) Cell, 102, 553-63). AID is also known to be deficient in human patients afflicted 
with hyper IgM syndrome type II (Revy, P., et al., (2000) Cell, 102, 565-75). Although 
AID displays significant homology to APOBEC-1, it can act directly on DNA (Petersen- 
Mahrt, S.K., (2000) Nature, 418, 99-103; Harris, R.S., et al., (2002) Mol Cell, 10, 1247-53, 

25 herein incorporated by reference in their entireties). In GC B cells, SHM is usually 
restricted to Ig genes, however, AID is constitutively expressed in human B cell 
malignancies such as diffuse large B cell lymphomas (DLBCL) and some chronic 
lymphocytic leukemias (CLL), as well as in other lymphomas of both germinal center (GC) 
and non-GC origin (Greeve, J., et al., (2003) Blood, 101, 3574-80; McCarthy, H., et al., 

30 (2003) Blood, 101, 4903-8; Oppezzo, P., et al., (2003) Blood, 101, 4029-32, herein 
incorporated by reference in their entireties). In CLL samples, AID expression was 
specifically associated with poor prognostic features. Paradoxically, poor prognosis in CLL 
correlates with the absence of Ig gene SHM (Damle, R.N., et al., (1997) Blood, 94, 1840-7; 
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Hamblin, T.J., et aL, (1999) Blood, 94, 1848-54 herein incorporated by reference in their 
entireties), showing that AID function is altered in these cells. AID expression in the 
absence of ongoing SHM is also observed in certain diffuse large B-cell lymphomas 
(DLBCLs), specifically those defined as "activated B cell-like" based upon gene expression 
5 profiles and surface markers (Lossos, LS., et aL, (2000) PNAS USA, 97, 10209-13, herein 
incorporated by reference in its entirety). Here too, a worse outcome is associated with the 
non-mutating phenotype compared to DLBCLs in which ongoing Ig SHM is detected, such 
as those of GC-like phenotype (Rosenwald, A., et aL, (2003) N Engl J Med, 346, 1937-47; 
Alizadeh, A. A., et aL, (2000) Nature, 403, 503-1 1, herein incorporated by reference in their 
10 entireties). 



Table 1: Genetic Location and Properties of human APOBEC-1 Related Proteins (ARPs) 


Gene 


Locus 


Accession 
Number 


Equivalent 
Names 


Expressed 
Sequence 


CtoU 
Activity 


APOBECA 


12pl3.1 


AAD00185 




Small & Large 
Intestine 


apoB mRNA 


APOBEC-2 


6p21 


NP_006780 


CAB44740/ARCD1 


Cardiac & 


— 








Skeletal Muscle 




AID 


12pl3 


NP 065712 




B lymphocytes 


DNA 


APOBEC-3A 


22ql3.1 


NP 663745 


Phorbolin-1 


Keratinocytes 




APOBEC-3& 


22ql3.1 


Q9UH17 


Phorbolin-3 
Phorbolin- 1 -related 
Phorbolin-2 
APOBEC-1 L 
ARCD-3 


Keratinocytes, 
colon 




APOBEC-3C 


22ql3.1 


CAB45271 


Phorbolin-1 
ARCD-2/ARCD-4 


Spleen, testes, 
heart, thymus, 
prostate, ovary, 
uterus, PBLs, 




APOBEC- 


22ql3.1 


NMJ45298 




uterus 




3D+E 












APOBEC-3D 


22ql3.1 


BF841711 




head and neck 
cancers 




APOBEC-3E 


22ql3.1 


Pseudogene 


ARCD-6 






APOBEC-3F 


22ql3.1 


BG 758984 


ARCD-5 


B lymphocytes 




APOBEC-3G 


22ql3.1 


NPJ)68594 


Phorbolin-like- 
protein, MDS019, 
HsCEM15 


Spleen, breast, 
heart, thymus, 
PBLs, colon, 
stomach, kidney, 
uterus, pancreas, 
placenta, prostate 


Viral 
DNA 


22ql3J 


22ql3.1 


XP 092919 








12q23 




XP 115170 









In subsets of DLBCL and CLL, AID expression is uncoupled from somatic 
hypermutation activity, a feature that correlates with more aggressive forms of these 
15 diseases. These characteristics show that AID function is aberrant in B cell cancers. In fact, 
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oncogene mutations with patterns resembling SHM have been found at high frequency in B 
cell lymphomas. One explanation for these data is that loss of target specificity during the 
SHM process may be involved in the transformation and/or progression of B lymphoid 
malignancies. Constitutive AID expression in transgenic mice was shown to cause T cell 
5 lymphomas and pulmonary adenomas, formally demonstrating AID's oncogenic potential. 
Finally, and most significantly, transgenic expression of AID under a ubiquitous promoter 
induces T cell lymphomas and pulmonary adenomas, accompanied by extensive mutations 
in the c-myc protooncogene, confirming that AID can act as a bona fide oncogene when 
ectopically expressed (Okazaki, I.M., et al., (2003) J Exp Med, 197, 1 173-81,). Hence, the 

10 oncogenic effect of AID appears to be attributable to loss of regulation over a "normal" 
DNA mutase activity, as a consequence of over-expression of AID isoforms with altered 
function, or defects in cofactors involved in determining specificity of SHM targeting. The 
consequence is genome-wide mutagenesis contributing to rapid accumulation of multiple 
oncogenic hits, resulting in accelerated tumor progression. Therefore, selectively disabling 

1 5 the deaminase activity of AID in neoplastic cells using a cytidine deaminase inhibitor is a 
useful therapeutic strategy. 

Another ARP of immediate medical relevance is hsCEM15/APOBEC-3G (Table 1), 
which has been described as a broad antiviral agent that reduces the infectivity of the human 
lentivirus, HIV-1 (Mangeat, B., et al., 2003, Nature 424, 99-103). The virus contains a 10- 

20 kb single-stranded, positive-sense RNA genome that encodes three major classes of gene 
products. One class known as "auxiliary" proteins (Vpr, Vif, Vpu, Nef) are not required for 
efficient virus replication in at least some settings in cell culture. One of these proteins, Vif 
(virion infectivity factor), is required for efficient virus replication in vivo, as well as in 
certain host cell types in vitro (Fisher, A.G., et al., (1987) Science, 237,:888-93; Strebel, K., 

25 et al., (1987) Nature, 328,:728-30), because of its ability to overcome the action of a cellular 
antiviral system (Simon, J.H., et al., (1998) Nat Med, 4, 1397-400; Madam, N. and D. 
Kabat, (1998) J Virol, 72, : 1025 1-5). 

The in vitro replicative phenotype of vif-deleted molecular clones of HIV-1 is 
strikingly different in vif-permissive cells (such as 293T cells, and the SUPT1 and CEM-SS 

30 T cell lines), as compared to vif-non-permissive cells (such as primary T cells and 

macrophages). In the former cells, vif-deleted HIV-1 clones replicate with an efficiency 
that is essentially identical to that of wild-type virus, whereas in the latter cells, replication 
of vif-negative HIV-1 mutants is arrested due to a failure to accumulate reverse transcripts 
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and to generate infectious proviral integrants in the host cell (Courcoul, M., et al., (1995) J 
Virol, 69, :2068-74; Simon, J.H. and M.H. Malim, (1996) J Virol, 70, 5297-305; von 
Schwedler, U., et al., (1993) J Virol, 67, 4945-55; Sova, P. and DJ. Volsky (1993) J Virol, 
67, 6322-6). These defects are due to the expression of host cell protein CEM15 in vif-non- 
5 permissive cells (Sheehy, A.M., et al., (2002) Nature, 418, 646-50). The instability of viral 
reverse transcripts in vif-non-permissive cells in the absence of Vif, and the fact that both 
Vif and CEM15 are present in HIV-l virions (Sheehy, A.M., et al., (2002) Nature, 418, 
646-50; Camaur, D. (1996) J Virol, 70, 6106-1 1; Liu, H., et al., (1995) J Virol, 69, 7630-8), 
strongly suggests that the CEM15 antiviral activity may be derived from effects on viral 

10 RNA or reverse transcripts (viral DNA). This HIV RNA/DNA modification hypothesis is 
the most likely explanation for the CEM15 effect on viral infectivity. Due to the "broad" 
anti- viral activity described for CEM15, cytidine deaminase inhibitors can be used in the 
context of retroviral-based expression of exogenous nucleic acids. 

The reason(s) for the modest success of retroviral mediated delivery of gene therapy 

15 (Chang, LJ. and E.E. Gay, (2001) Curr Gene Ther, 1, 237-51) can be at least in part due to 
the activity of enzymes like CEM15. It is likely that editing enzymes in general act to 
prevent retroviral infection through the body. ARPs and Adenosine Deaminase active on 
RNA (ADRA1 and ADAR2) are selectively expressed tissues and regions within tissues 
(Barbon , A. et al. (2003) Brain Res. Mol Brain Res. 1 17,168-78; Sowden et al., (2000) J. 

20 Cell Sci. 115, 1027-39, herein incorporated by reference in their entireties). Each enzyme 
requires a unique flanking sequence context for cytidine, deoxycytidine or adenosine 
editing. CEM15 (Zhang et al., (2003) Nature 424,94-8; Lecossier et al., Science 300, 1112) 
and ADAR1 (George, C.X. & Samuel, C.E. (1999) PNAS USA 96, 4621-26; Jayan, G.C. & 
Casey, J.I. (2002) J. Virol. 76, 12399-404, Casey, J.L. (2002) J. Virol. 76,7385-97, herein 

25 incorporated by reference in their entireties) use target sequence and RNA secondary 

structure to edit viral RNA and DNA. Inactivating CEM15, ADARs or other ARPs with 
cytidine deaminase inhibitors can be used to selectively inhibit editing in tissues where 
these enzymes are expressed while allowing enzymes that were not targeted with cytidine 
deaminase inhibitors to remain active in other tissues. Tissues in which cytidine deaminase 

30 inhibitor therapy inactivated editing enzymes are not be able to edit retrovirus and thereby 
become more susceptable to gene thereapy, whereas all other tissues are refractory to 
retroviral infection because their editing enzymes does not interact with the cytidine 
deaminase inhibitors and therefore remain active to modify and inactive the retroviral 
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genome and transgenes. Cytidine deaminase inhibitor and retrovirus vectors are used in 
combination to not only enhance the efficiency of gene delivery but also enable retorviral 
delivery which are used tissue specifically. 

The results of x-ray crystallographic structural studies and comparative modeling 
5 show that the overall molecular architecture and enzyme active sites of cytidine deaminases 
share common features that enable binding and activity on large nucleic acid substrates. 
Furthermore, these active sites are susceptible to inhibition by cytidine deaminase inhibitors 
that target all related enzymes indiscriminately. Therefore, embedding the cytidine 
deaminase inhibitor within specific polynucleotide target sequences preferred by each 
10 respective cytidine deaminase causes selective inhibition of the cytidine deaminase. In this 
context, it is possible to direct inhibition of only selected cytidine deaminase activities. The 
large size of the embedded cytidine deaminase inhibitor precludes interaction and 
cytotoxicity associated with many cytidine deaminases involved in pyrimidine metabolism. 

Definitions 

15 As used in the specification and the appended claims, the singular forms "a," "an" 

and "the" include plural referents unless the context clearly dictates otherwise. Thus, for 
example, reference to "an inhibitor" includes mixtures of the inhibitors. 

Ranges may be expressed herein as from "about" one particular value, and/or to 
"about" another particular value. When such a range is expressed, another embodiment 

20 includes from the one particular value and/or to the other particular value. Similarly, when 
values are expressed as approximations, by use of the antecedent "about," it will be 
understood that the particular value forms another embodiment. It will be further 
understood that the endpoints of each of the ranges are significant both in relation to the 
other endpoint, and independently of the other endpoint. 

25 In this specification and in the claims which follow, reference will be made to a 

number of terms which shall be defined to have the following meanings: 

"Optional" or "optionally" means that the subsequently described event or 
circumstance may or may not occur, and that the description includes instances where said 
event or circumstance occurs and instances where it does not. 

30 Ranges may be expressed herein as from "about" one particular value, and/or to 

"about" another particular value. When such a range is expressed, another aspect includes 
from the one particular value and/or to the other particular value. Similarly, when values 
are expressed as approximations, by use of the antecedent "about," it will be understood that 
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the particular value forms another aspect. It will be further understood that the endpoints of 
each of the ranges are significant both in relation to the other endpoint, and independently 
of the other endpoint. 

References in the specification and concluding claims to parts by weight of a 
5 particular element or component in a composition or article, denotes the weight relationship 
between the element or component and any other elements or components in the 
composition or article for which a part by weight is expressed. Thus, in a compound 
containing 2 parts by weight of component X and 5 parts by weight component Y, X and Y 
are present at a weight ratio of 2:5, and are present in such ratio regardless of whether 
10 additional components are contained in the compound. 

A weight percent of a component, unless specifically stated to the contrary, is based 
on the total weight of the formulation or composition in which the component is included. 

The terms "higher," "increases," "elevates," "enhances," or "elevation" refer to 
increases above basal levels, e.g., as compared to a control. The terms "low," "lower," 
15 "reduces," "suppresses" or "reduction" refer to decreases below basal levels, e.g., as 

compared to a control. For example, basal levels are normal in vivo levels prior to, or in the 
absence of, or addition of an agent such as a cytidine deaminase inhibitor or another 
molecule or ligand. 

The term "test compound" is defined as any compound to be tested for its ability 
20 (efficacy) to inhibit a cytidine deaminase molecule or a deoxycytidine deaminase molecule. 
Also, "test compounds" include drugs, molecules, and compounds that come from 
combinatorial libraries where thousands of such ligands are screened by drug class. 

By "subject" is meant an individual. Preferably, the subject is a mammal such as a 
primate, and, more preferably, a human. The term "subject" can include domesticated 
25 animals, such as cats, dogs, etc., livestock (e.g., cattle, horses, pigs, sheep, goats, etc.), and 
laboratory animals (e.g., mouse, rabbit, rat, guinea pig, etc.). 

The terms "control levels" or "control cells" are defined as the standard by which a 
change is measured, for example, the controls are not subjected to the experiment, but are 
instead subjected to a defined set of parameters, or the controls are based on pre- or post- 
30 treatment levels. 

By "contacting" is meant an instance of exposure by close physical contact of at 
least one substance to another substance. For example, contacting can include contacting a 
substance, such as a pharmacologic agent, with a cell. A cell can be contacted with a test 
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compound, for example, a cytidine deaminase inhibitor, or putative cytidine deaminase 
inhibitor by adding the agent to the culture medium (by continuous infusion, by bolus 
delivery, or by changing the medium to a medium that contains the agent) or by adding the 
agent to the extracellular fluid in vivo (by local delivery, systemic delivery, intravenous 
5 injection, bolus delivery, or continuous infusion). The duration of contact with a cell or 
group of cells is determined by the time the test compound is present at physiologically 
effective levels or at presumed physiologically effective levels in the medium or 
extracellular fluid bathing the cell. 

"Treatment" or "treating" means to administer a composition to a subject or a 

10 system with an undesired condition or at risk for the condition. The condition can include a 
disease or a predisposition to a disease. The effect of the administration of the composition 
to the subject can have the effect of but is not limited to reducing or preventing the 
symptoms of the condition, a reduction in the severity of the condition, or the complete 
ablation of the condition. 

1 5 By "effective amount" is meant a therapeutic amount needed to achieve the desired 

result or results, e.g., inhibiting editing of nucleic acids, interrupting selective cytidine 
deaminase activity, reducing viral production or infectivity, inhibiting class switch 
recombination, inhibiting somatic hypermutation, enhancing or blunting physiological 
functions, enhancing viral vector therapy, altering the qualitative or quantitative nature of 

20 the proteins expressed by cell or tissues, and eliminating or reducing disease causing 
molecules and/or the mRNA or DNA that encodes them, etc. 

Herein, "inhibition" or "suppression" means to reduce activity as compared to a 
control (e.g., basal activity in the absence of such inhibition). It is understood that 
inhibition or suppression can mean a slight reduction in activity to the complete ablation of 

25 all activity. An "inhibitor" or "suppressor" can be anything that reduces the targeted 

activity. For example, suppression of CEM15 by a cytidine deaminase inhibitor can be 
determined by assaying the amount of CEM15 activity in the presence of the cytidine 
deaminase inhibitor to the amount of CEM15 activity in the absence of the cytidine 
deaminase inhibitor. In this example, if the amount of CEM15 activity is decreased in the 

30 presence of the context dependent inhibitor as compared to the amount of CEM15 activity 
in the absence of the context dependent inhibitor, the cytidine deaminase inhibitor can be 
said to suppress CEM15 activity. 
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Methods disclosed herein may refer to "systems." It is understood that systems can 
be, for example, cells, columns, or batch processing containers (e.g., culture plates). A 
system is a set of components, any set of components that allows for the steps of the method 
to performed. Typically a system will comprise one or more components, such as a 
5 protein(s) or reagent(s). One type of system disclosed would be a cell that comprises both a 
cytidine deaminase and a cytidine deaminase inhibitor, for example. Another type of 
system would be one that comprises a cell (e.g., a cancer cell). A third type of system might 
be a chromatography column that has CEM15, AID, or other deaminase or putative 
deaminase, bound to the column. 

10 "Context dependent inhibitor" or cytidine deaminase inhibitor means any inhibitor 

found within the context of a polymeric substrate that has a targeting function. The 
polymeric substrate can be a nucleic acid, but it is not limited to such molecules. 

A "cytidine deaminase-positive cell" means any cell that expresses one or more 
cytidine deaminases or deoxycytidine deaminases. Such expression can be naturally 

1 5 occurring or the cell can include an exogenous nucleic acid that encodes one or more 
selected deaminases. 

By "polymeric substrate" is meant a nucleic acid sequence into which the cytidine 
deaminase inhibitor has been incorporated. The polymeric substrate of the cytidine 
deaminase inhibitor can comprise an oligonucleotide or a polynucleotide, for example. The 

20 nucleic acid sequence can be 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 
21, 22, 23, ,24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 
45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 
69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 
93, 94, 95, 96, 97, 98, 99, or 100 or more residues in length. Also, the nucleic acid sequence 

25 can be an RNA or a DNA sequence, can comprise naturally occurring or non-naturally 
occurring nucleotides, and can comprise double or single stranded sequences. A double 
stranded sequence can comprise a single stranded region such as a replication bubble, but is 
not limited to such duplex structures. Double-stranded need not imply Watson-Crick base 
pairing interactions as the sole feature that holds the two-stranded molecule together (e.g. 

30 there can be non-base-paired as well as base paired regions etc). 

Variables such as R^R 10 , W, X 1 , X 2 , Y, and Z used throughout the application are 
the same variables as previously defined unless stated to the contrary. 
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The term "alkyl group" as used herein is a branched or unbranched saturated 
hydrocarbon group of 1 to 24 carbon atoms, such as methyl, ethyl, «-propyl, isopropyl, 
w-butyl, isobutyl, J-butyl, pentyl, hexyl, heptyl, octyl, decyl, tetradecyl, hexadecyl, eicosyl, 
tetracosyl and the like. A "lower alkyl" group is an alkyl group containing from one to six 
5 carbon atoms. 

The term "alkenyl group" is defined as a hydrocarbon group of 2 to 24 carbon atoms 
and structural formula containing at least one carbon-carbon double bond. 

The term "alkynyl group" is defined as a hydrocarbon group of 2 to 24 carbon atoms 
and a structural formula containing at least one carbon-carbon triple bond. 
10 The term "halogenated alkyl group" is defined as an alkyl, alkenyl, or alkynyl group 

as defined above with one or more hydrogen atoms present on these groups substituted with 
a halogen (F, CI, Br, I). 

The term "cycloalkyl group" is defined as a non-aromatic carbon-based ring 
composed of at least three carbon atoms. Examples of cycloalkyl groups include, but are 
15 not limited to, cyclopropyl, cyclobutyl, cyclopentyl, cyclohexyl, etc. The term 

"heterocycloalkyl group" is a cycloalkyl group as defined above where at least one of the 
carbon atoms of the ring is substituted with a heteroatom such as, but not limited to, 
nitrogen, oxygen, sulphur, or phosphorous. 

The term "aryl group" as used herein is any carbon-based aromatic group including, 
20 but not limited to, benzene, naphthalene, etc. The term "aromatic" also includes "heteroaryl 
group," which is defined as an aromatic group that has at least one heteroatom incorporated 
within the ring of the aromatic group. Examples of heteroatoms include, but are not limited 
to, nitrogen, oxygen, sulfur, and phosphorus. The aryl group can be substituted or 
unsubstituted. The aryl group can be substituted with one or more groups including, but not 
25 limited to, alkyl, alkynyl, alkenyl, aryl, halide, nitro, amino, ester, ketone, aldehyde, 
hydroxy, carboxylic acid, or alkoxy. 

The term "aralkyl" is defined as an aryl group having an alkyl, alkynyl, or alkenyl 
group as defined above attached to the aromatic group. An example of an aralkyl group is a 
benzyl group. 

30 The term "amine group" is represented by the formula -NRR', where R and R' can 

be, independently, hydrogen, an alkyl, alkenyl, alkynyl, aryl, aralkyl, cycloalkyl, 
halogenated alkyl, or heterocycloalkyl group described above. 

The term "amide group" is represented by the formula -C(0)NRR', where R and R' 
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can be hydrogen, an alkyl, alkenyl, alkynyl, aryl, aralkyl, cycloalkyl, halogenated alkyl, or 
heterocycloalkyl group described above. 

The term "ester" is represented by the formula -OC(0)R, where R can be an alkyl, 
alkenyl, alkynyl, aryl, aralkyl, cycloalkyl, halogenated alkyl, or heterocycloalkyl group 
5 described above. 

The term "hydroxyl group" is represented by the formula -OH. The term "alkoxy 
group" is represented by the formula -OR, where R can be an alkyl, alkenyl, alkynyl, aryl, 
aralkyl, cycloalkyl, halogenated alkyl, or heterocycloalkyl group described above. 

The term "aldehyde" is represented by the formula -C(0)H. 
10 The term "keto group" is represented by the formula -C(0)R, where R is an alkyl, 

alkenyl, alkynyl, aryl, aralkyl, cycloalkyl, halogenated alkyl, or heterocycloalkyl group 
described above. 

The term "acyl" is represented by the formula -OC(0)R, where R is hydrogen, an 
alkyl, alkenyl, alkynyl, aryl, aralkyl, cycloalkyl, halogenated alkyl, or heterocycloalkyl 
1 5 group described above. 

The term "sulfone" is represented by the formula -S(0) 2 R, where R is hydrogen, an 
alkyl, alkenyl, alkynyl, aryl, aralkyl, cycloalkyl, halogenated alkyl, or heterocycloalkyl 
group described above. 

The term "halo" is defined as F, CI, Br, or L 
20 The term "monophosphate" (or monophosphoryl) is represented by the formula 

(RO) 2 (0)PO-, where each R can be, independently, hydrogen, an alkyl, alkenyl, alkynyl, 
aryl, aralkyl, cycloalkyl, halogenated alkyl, heterocycloalkyl group as described above, or 
the salt thereof. 

The term "diphosphate" (or diphosphoryl) is a double phosphoanhydride bond 
25 represented by the formula (RO) 2 (0)P-0-P(0)(OR)0-, where each R can be, independently, 
hydrogen, an alkyl, alkenyl, alkynyl, aryl, aralkyl, cycloalkyl, halogenated alkyl, 
heterocycloalkyl group as described above, or the salt thereof. 

The term "triphosphate"(or triphosphoryl) is a triple phosphoanyhdride bond 
represented by the formula (RO) 2 (0)P-0-P(0)(OR)0-P(0)(OR)0-, where each R can be, 
30 independently, hydrogen, an alkyl, alkenyl, alkynyl, aryl, aralkyl, cycloalkyl, halogenated 
alkyl, heterocycloalkyl group as described above, or the salt thereof. 

The term "phosphorothioate" is a P-S bond represented by the formula 
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(RO)2(SR)PO-, where S is a sulfur (and can be a thiolate), and each R can be, 
independently, hydrogen, an alkyl, alkenyl, alkynyl, aryl, aralkyl, cycloalkyl, halogenated 
alkyl, heterocycloalkyl group as described above, or the salt thereof. 

The term "phosphoroamidate" is a P-N bond represented by the formula 
5 (RO)2CNRR')PO-, where each R and R' can be, independently, hydrogen, an alkyl, alkenyl, 
alkynyl, aryl, aralkyl, cycloalkyl, halogenated alkyl, heterocycloalkyl group as described 
above, or the salt thereof. 

The term "phosphonate" is a P-C bond represented by the formula (RO) 2 (CRR')PO-, 
where each R and R' can be, independently, hydrogen, an alkyl, alkenyl, alkynyl, aryl, 
10 aralkyl, cycloalkyl, halogenated alkyl, heterocycloalkyl group as described above, or the salt 
thereof 

A 2'-5' (2'-to-5') phosphodiester linkage is a phosphoanhydride linkage between the 
2'-oxygen of the nucleotide ribose sugar and the 5 '-phosphorus of the flanking nucleotide. 
This bond is chemically distant from the typical 3 '-5' phosphodiester bond observed in 

15 cellular (natural) DNA or RNA synthesis via polymerase enzymes. The linkage is resistant 
to cellular degradation enzymes. 

R ! -R 10 , W, X 1 , X 2 , Y, andZ can, independently, possess two or more of the groups 
listed above. For example, if R 1 is a straight chain alkyl group, one of the hydrogen atoms 
of the alkyl group can be substituted with a hydroxyl group, an alkoxy group, etc. 

20 Depending upon the groups that are selected, a first group may be incorporated within 

second group or, alternatively, the first group may be pendant (i.e., attached) to the second 
group. For example, with the phrase "an alkyl group comprising an ester group," the ester 
group may be incorporated within the backbone of alkyl group. Alternatively, the ester can 
be attached to the backbone of the alkyl group. The nature of the group(s) that is (are) 

25 selected will determine if the first group is embedded or attached to the second group. 

Disclosed are compounds, compositions, and components that can be used for, can 
be used in conjunction with, can be used in preparation of, or are products of the disclosed 
methods and compositions. These and other materials are disclosed herein, and it is 
understood that when combinations, subsets, interactions, groups, etc. of these materials are 

30 disclosed that while specific reference of each various individual and collective 

combinations and permutation of these compounds may not be explicitly disclosed, each is 
specifically contemplated and described herein. For example, if a number of different 
nucleosides and polymeric substrates are disclosed and discussed, each and every 
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combination and permutation of the nucleoside and the polymeric substrate are specifically 
contemplated unless specifically indicated to the contrary. Thus, if a class of molecules A, 
B, and C are disclosed as well as a class of molecules D, E, and F, and an example of a 
combination molecule, A-D is disclosed, then even if each is not individually recited, each 
5 is individually and collectively contemplated. Thus, in this example, each of the 

combinations A-E, A-F, B-D, B-E, B-F, C-D, C-E, and C-F are specifically contemplated 
and should be considered disclosed from disclosure of A, B, and C; D, E, and F; and the 
example combination A-D. Likewise, any subset or combination of these is also 
specifically contemplated and disclosed. Thus, for example, the sub-group of A-E, B-F, and 

10 C-E are specifically contemplated and should be considered disclosed from disclosure of A, 
B, and C; D, E, and F; and the example combination A-D. This concept applies to all 
aspects of this disclosure including, but not limited to, steps in methods of making and 
using the disclosed compositions. Thus, if there are a variety of additional steps that can be 
performed it is understood that each of these additional steps can be performed with any 

15 specific embodiment or combination of embodiments of the disclosed methods, and that 
each such combination is specifically contemplated and should be considered disclosed. 
Compounds and Compositions 

Cytidine deaminases are a class of enzymes involved in pyrimidine metabolism. 
The mechanism for cytidine deaminase activity involves nucelophilic attack at the C4 

20 position of the cytosine ring by water coordinated to Zn 2+ , where the zinc ion is bound to 
the enzyme. Thus, any nucleoside susceptible to nucleophilic attack or one that mimics the 
tetrahedral sp 3 geometry of the intermediate (but need not be chemically reactive) can be 
used in any of the methods described herein as a component of a cytidine deaminase 
inhibitor. In one aspect, one or more cytidine deaminase inhibitors can be used in the 

25 polymeric substrate to inhibit cytidine deaminases. 

The C4 position of cytosine is a sp 2 carbon center, which is susceptible to 
nucleophilic attack by the enzyme. In one aspect, the cytidine deaminase inhibitor used 
herein is a nucleoside comprising a sp 2 carbon center susceptible to nucleophilic attack 
giving rise to the enzyme's preferred sp 3 intermediate. Methods for making a compound 

30 susceptible to nucleophilic attack and those whose geometry are preferred by the enzyme 
are known in the art. For example, attaching an electron-withdrawing group on the atom 
where nucleophilic attack is desired can be performed. As used herein, the term "electron- 
withdrawing group" is defined as any group that makes a compound more susceptible to 
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nucleophilic when compared to the same compound that does not contain the electron- 
withdrawing group. In one aspect, the cytidine deaminase inhibitor is a nucleoside 
comprising a sp 2 carbon center with an electron-withdrawing group attached to the sp 2 
carbon center. Examples of electron-withdrawing groups include, but are not limited to, a 
5 nitro group, a cyano group, an ester group, an aldehyde group, a keto group, a sulfone 

group, a carbonyl group, an imino group, an alkenyl, an amide group, a hydroxyl group, or a 
combination thereof. Two or more electron- withdrawing groups can be used in 
combination with one another. For example, the electron-withdrawing group can be an enol 
represented by the formula -C=C-OH, which is composed of an alkenyl group and a 

10 hydroxyl group. In this example, the enol can tautomerize to the corresponding keto 
compound -HC-C(O). 

As described above, in one aspect, the cytidine deaminase inhibitor of cytidine 
deaminases described herein are susceptible to nucleophilic attack. In one aspect, the 
Cytidine deaminase inhibitor of cytidine deaminases are susceptible to nucleophilic attack 

15 by a metal-bound hydroxide such as, for example, zinc hydroxide. 

In one aspect, the cytidine deaminase inhibitor of cytidine deaminases (or its family 
members the adenososine deaminases acting on RNA or ADARs, including those with 
activity on mRNA, tRNA or rRNA) used herein comprise a purine or pyrimidine. The 
terms "purine" and "pyrimidine" include any natural and non-natural purines and 

20 pyrimidines. When the Cytidine deaminase inhibitor of cytidine deaminases comprise a 
pyrimidine, nucleophilic attack can occur at the C4 position by the cytidine deaminase. 
Likewise, when the cytidine deaminase inhibitor of cytidine deaminases comprise a purine, 
nucleophilic attack can occur at the C6 position of the purine by the cytidine deaminase. In 
order to enhance nucleophilic attack at these positions, one or more electron-withdrawing 

25 groups can be placed at the C4 position of the pyrimidine or C6 position of the purine. 
However, the cytidine deaminase inhibitor can function by simply mimicking the sp3 
intermediate at the C4 or C6 positions of pyrmidine or purine bases. 

In one aspect, any of the nucleosides described herein are provided as the indicated 
enantiomer and substantially in the absence of its corresponding enantiomer (i.e., in 

30 enantiomerically enriched form). As used herein, the term "enantiomerically pure" refers to 
a nucleoside composition that includes at least approximately 95%, and preferably 
approximately 97%, 98%, 99%, or 100% of a single enantiomer of that nucleoside. In one 
aspect, the nucleoside is the naturally occurring D-enantiomer or the L-enantiomer. 
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In one aspect, the cytidine deaminase inhibitor of cytidine deaminases includes a 
nucleoside having the formula I or II 



R\R 2 ,and R 3 can be, independently, hydrogen, hydroxyl, alkoxy, halo, alkyl, acyl, 
aryl, aralkyl, monophosphate, diphosphate, triphosphate , phosphorothioate, 
phosphoramidate, phosphonate, or a 2 '-5' phosphodi ester linkage, N 3 , NR 4 R 5 , N0 2 , 
NOR 6 , CN, ~C(0)NH 2 , SH, -S-alkyl, -S-aryl, Se-alkyl, Se-aryl, or a residue of the 
polymeric substrate, wherein at least one of R 2 or R 3 is hydroxyl; 
X*,X 2 , andY can be, independently, hydrogen, hydroxyl, alkoxy, alkyl, acyl, aryl, 
aralkyl, NR 4 R 5 , or an electron-withdrawing group; 

wherein R 4 , R 5 , and R 6 can be, independently, alkyl, aryl, aralkyl, alkaryl, acyl, or 
hydrogen; and 

Z can be nitrogen or CR 7 , wherein R 7 is hydrogen, hydroxyl, alkoxy, halo, alkyl, 
acyl, aryl, or aralkyl. 

Any of the Cytidine deaminase inhibitors described herein can be incorporated in the 
polymeric substrate. By "incorporated into" is meant bracketed by (1) the polymeric 
substrate, (2) at either terminus (e.g. 5' or 3') of the polymeric substrate, or (3) a 
combination thereof. In one aspect, when the context-dependent cytidine deaminase 
inhibitor having the formula I or II is bracketed by the polymeric substrate, R 1 and R 2 are a 
residue of the polymeric substrate. In another aspect, when the cytidine deaminase inhibitor 
having the formula I or II is the terminus of the polymeric substrate, R 1 or R 2 is a residue of 




I 



II 



wherein 
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the polymeric substrate. The term "residue" as used in the specification and claims, refers 
to any moiety of the polymeric substrate including the moiety attached to any of the 
nucleosides described herein after the nucleoside has been incorporated (e.g., within the 
polymeric substrate or at the terminus or both) into the polymeric substrate. 
5 In one aspect, X in formulae I and II is any of the electron-withdrawing groups 

described above. In one aspect, when the nucleoside is formula I, R 2 is hydroxyl and R 3 can 
be hydroxyl, halo, or alkoxy. In another aspect, when the nucleoside has the formula I, X 1 
can be halo such as fluoro, chloro, bromo, or iodo, and X 2 is hydroxyl. In a further aspect, 
when the nucleoside is I, Z is CH. The nuclecoside having the formula I can have any 

10 combination of these aspects. 

In another aspect, when the nucleoside has formula II, R 2 is hydroxyl and R 3 can be 
hydroxyl, halo, or alkoxy. In a further aspect, when the nucleoside has the formula II, X 1 
can be hydrogen, hydroxyl, or NR 4 R 5 . In another aspect, when the nucleoside has the 
formula II, Y is hydrogen. In yet another aspect, when the nucleoside has the formula II, Z 

1 5 can be nitrogen or CR 7 , wherein R 7 can be hydrogen or hydroxyl. When R 7 is hydroxyl, the 
resulting enol can tautomerize to the keto form. The nuclecoside having the formula II can 
have any combination of these aspects. 

In another aspect, any of the nucleosides described herein has at least one group 
capable of hydrogen bonding in a manner similar to cytosine or uracil. In one aspect, R 2 

20 and /or R 3 in formulae I- VII, where formulae III- VII are discussed below, is a hydroxyl 
group. Nucleosides having at least one group capable of hydrogen bonding increases the 
nucleoside's affinity for the cytidine deaminase. 

Mechanistic studies have indicated that cytidine deaminases exhibit greater affinity 
for the tetrahedral intermediate (sp ) than the corresponding sp center prior to nucleophilic 

25 attack. In one aspect, the nucleosides described herein have at least one stable sp center 
that mimics the intermediate at C4 or C6 of pyrimidine or purine. In a related aspect, a 
compound such as cacodylic acid also known as dimethyl arsenic acid ((CHb^-As-ChH) 
mimics the sp 3 geometry and can serve as a cytidine deaminase inhibitor as long as it is 
tethered to the flanking sequences at 5' and 3' termini. The group at C4 need not be carbon, 

30 and can be replaced by As in the latter instance. In another aspect, the nucleoside has at 
least one sp 3 carbon center, wherein the sp 3 carbon center has at least one -WR group, 
wherein W is O, S, Se, or NR, wherein R is hydrogen, alkyl, aryl, aralkyl, alkenyl, alkynyl, 
or acyl as described above. In a further aspect, when the nucleoside includes a pyrimidine 
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or purine, the nucleoside comprises a sp 3 carbon center at C4 of the pyrimidine or C6 of the 



purine. 



In one aspect, the cytidine deaminase inhibitor includes a nucleoside having the 
formula III or IV 



R 1 , R 2 , and R 3 can be, independently, hydrogen, hydroxyl, alkoxy, halo, alkyl, acyl, 
aryl, aralkyl, monophosphate, diphosphate, triphosphate , phosphorothioate, 
phosphoramidate, phosphonate, or a 2'-5' phosphodiester linkage, N 3 , NR 4 R 5 , NO2, 
NOR 6 , CN, -C(0)NH 2 , SH, -S-alkyl, -S-aryl, Se-alkyl, Se-aryl, or a residue of the 
polymeric substrate, wherein at least one of R 2 or R 3 is hydroxyl; 
X 1 , X 2 , and Y can be, independently, hydrogen, hydroxyl, alkoxy, alkyl, acyl, aryl, 
aralkyl, NR 4 R 5 , or an electron-withdrawing group; 
W can be O, S, Se, or NR 10 ; 

wherein R 4 , R 5 , R 6 , R 8 , R 9 , and R 10 can be, independently, alkyl, aryl, aralkyl, 
alkaryl, acyl, or hydrogen; and 

Z can be nitrogen or CR 7 , wherein R 7 is hydrogen, hydroxyl, alkoxy, halo, alkyl, 
acyl, aryl, or aralkyl. 

In one aspect, X 1 and X 2 in formulae III and IV can be any of the electron- 
withdrawing groups described above. In one aspect, when the nucleoside is formula III, R 2 
is hydroxyl and R 3 can be hydroxyl, halo, or alkoxy. In another aspect, when the nucleoside 
has the formulae III or IV, X 1 can be halo such as fluoro, chloro, bromo, or iodo, and X 2 is 




III 



IV 



wherein 
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hydroxyl. In a further aspect, when the nucleoside is III, Z is CH. The nuclecoside having 

the formulae III or IV can have any combination of these aspects. 

Examples of nucleosides useful in the methods described herein include, but are not 

limited to, any of the phosphoramidites (distinct from phosphoramidates) manufactured by 

5 Glen Research Corporation. In one aspect, the nucleoside can be 

5 f -dimethoxytrityl-N-acetyl-2-0-acetyl-cytisine arabinoside,3 l -[(2-cyanoethyl)-(N,N- 
diisopropyl)]-phosphoramidite 

5 , -dimethoxytrityl--N-benzoyl-adenosine,2'-0-methyl-3 , -[(2-cyanoethyl)-(N,N- 
1 0 diisopropyl)]-phosphoramidite 

S'-dimethoxytrityl-N-benzoyl-cytidine 
phosphoramidite 

15 5-dimethoxytrityl-N-acetyl-cytidm^^ 
phosphoramidite 

S'-dimethoxytrityl-N-dimethylaminomethylidene-guanosine, 2'-0-methyl-3'-[(2- 
cyanoethyl)-(N,N-diisopropyl)]-phosphoramidite 

20 

5 -dimethoxytrityl-uridine,2'-0-M 

5 f -dimethoxytrityl-N-phenoxyacety^^ 
diisopropyl)]-phosphoramidite 

25 

5 -dimethoxytrityl-N2-isopropylphenoxyacetyl-2 l -0-methyl-guanosine,3 , -[(2-cyanoethyl)- 
(N,N-diisopropyl)]-phosphoramidite 

5 , -dimethoxytrityl-N-benzoyl-2'-adenosine, 2'-0-methyl-3 , -succinoyl-long chain 
30 alkylamino-CPG 

S'-dimethoxytrityl-N-benzoyl^'-cytidine, 2'-0-methyl-3 , -succinoyl-long chain alkylamino- 
CPG 

35 S'-dimethoxytrityl-N-acetyl-cytidine^^O-methyl-S^succinoyl-long chain alkylamino-CPG 

S'-dimethoxytrityl-N-dimethylformamidine-guanosine^^O-methyl-S'-succinoyl-long chain 
alkylamino-CPG 

40 S'-dimethoxytrityl-uridine^'-O-methyl-S'-succinoyl-long chain alkylamino-CPG 
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N-(6-(0-dimethoxytrityl)-hexyl)-(2-carboxamide)-phthalimidyl-lcaa-CP 

S-dimethoxytrityl-N-acetyl-adenosine^'-O-acetyl— 3 f -succinoyl-long chain alkylamino- 
CPG 

5 

S'-dimethoxytrityl-N-acetyl-cytidine^'-acetate-S'-succinoyl-long chain alkylamino-CPG 

S-dimethoxytrityl-N-acetyl-guanosine^'-O-acetyl-S'-succinoyl-long chain alkylamino-CPG 

10 5'-dimethoxytrityl-inosine,2'-0-triisopropy^ 
diisopropyl)]-phosphoramidite 

5 f -dimethoxytrityl-4-(2-cyanoethylthio)-uri 
cyanoethyl)-(N,N-diisopropyl)]-phosphoramidite 

15 

S-dimethoxytrityl-S-Methyl-uridine^'^ 
(N,N-diisopropyl)]-phosphoramidite 

5'-dimethoxytrityl-N-acetyl-5-methyl-cytid^ 
20 cyanoethyl)-(N,N-diisopropyl)]-phosphoramidite 

S'-dimethoxytrityl-N-acetyl^'-deoxypurine riboside, 2 f -0-triisopropylsilyloxymethyl-3-[(2- 
cyanoethyl)-(N ? N-diisopropyl)]- phosphoramidite 

25 5 , -dimethoxytrityl-N2-(diacetyl)-2,6-diaminopurineriboside 5 2'-0- 
triisopropylsilyloxymethyl-3'-[(2-cyanoethyl)^ 

5 '-dimethoxytri tyl-5 -bromo-uridine,^ 
(N,N-diisopropyl)]-phosphoramidite 

30 

5 -dimethoxytrityl-5-iodo-uridine,2 , -0~triisopropylsilyloxymethyl-3 , -[(2-cyanoet^ 
diisopropy 1)] -phosphoramidite 

5 -dimethoxytrityl-[6-methyl-pyrrolo-[2,3-d]-pyrimidine-2(3H)-one]-2 , -0- 
35 triisopropylsilyloxymethyl-3 , -[(2-cyanoethyl)-(N,N-diisopropyl)]-phosphoramidite 

5 , -0-methyl-2'-deoxythymidine,3 , -[(2-cyanoethyl)-(N,N-diisopropyl)]-phosphoramidite 

5 , ~monomethoxytritylamino-2 , -deoxythymidine,3'-[(2-cyanoethyl)-(N,N-diisopropyl)]- 
40 phosphoramidite 

S'-dimethoxytrityl-N-benzoyl-S'-deoxyadenosine^'-succinoyl-long chain alkylamino-CPG 
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S'-dimethoxytrityl-N-benzoyl-S'-deoxycytosine, 2'-succinoyl-long chain alkylamino-CPG 
500 

5 S'-dimethoxytrityl-N-dimethylfon^ chain 
alkylamino-CPG 

S'-dimethoxytrityl-S-deoxythymidine^'-succinoyl-long chain alkylamino-CPG 

10 5 -dimethoxytrityl-N-succinoyl-long chain alkylamino-CPG ,2 f ,3'-deoxyCytosine 

N6-diisobutylaminomethylidene-2\3 -dideoxyadenosine,5 , -[(2-cyanoethyl)-(N,N- 
diisopropyl)]-phosphoramidite 

15 N4-diisobutylaminomethylidene-2 , ,3 t -dideoxycytidine,5 , -[(2-cyanoethyl)-(N,N- 
diisopropyl)] -phosphoramidite 

N2-dimethylaminomethylidene-2 , ,3'-dideoxyguanosine,5 ? -[(2-cyanoethyl)-(N > N- 
diisopropyl)]-phosphoramidite 

20 

2^3'-dideoxythymidine 3 5'-[(2-cyanoethyl)-(N ? N-diisopropyl)]-phosphoramidite 

5 , -dimethoxytrityl-N-phenoxyacetyl-2'-deoxyadenosine,3-[(2-cyanoethyl)-CN,N- 
diisopropyl)] -phosphoramidite 

25 

5'-dimethoxytrityl-N-acetyl-2 f -deoxycyt^^ 
phosphoramidite 

S'-dimethoxytxityl-N-p-isopropyl-ph 
30 diisopropyl)] -phosphoramidite 

5 , -dimethoxytrityl-N-benzoyl-5-methyl-2 , -deoxycytidine,2'-succinoyl- long chain 
alkylamino-CPG 500 

35 5 , -dimethoxytrityl-N2 ? N6-bis(diisobutylaminomethylidene)-2,6-diamino-2 , -deoxypurine 
riboside-3 f -[(2-cyanoethyl)-(N,N-diisopropyl)]-phosphoramidite 

5 , -dimethoxytrityl-N4-ethyl-2 f -deoxycytidine 5 3'-[(2-cyanoethyl)-(TS[,N-diisopropyl)]- 
phosphoramidite 

40 

5 , -dimethoxytrityl-N-benzoyl-2*-deoxyadenosine,3 , -[(niethyl)-(N,N-diisopropyl)]- 
phosphonamidite 
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5 , -dimethoxytrityl-N--isobutyiyl-2 , -deoxycytidine ? 3 f -[(methy 
phosphonamidite 

5 5 , -dimethoxytrityl-N4-acetyl-2 , -deoxycytidine3 , ~[(methyl)-(N 
phosphonamidite 

5'-dimethoxytrityl-N-isobutyryl^ 
phosphonamidite 

10 

S^dimethoxytrityl^'-deoxythymidin^ 

5'-dimethoxytrityl-N-phenoxyacetyl-2 , -deoxyadenosine,3'~[methyl-CN,N-diisopropyl)]- 
phosphoramidite 

15 

5 , -dimethoxytrityl-N-acetyl-2 , -deoxycytidine,3 , -[methyl-(N,N-diisopropyl)]- 
phosphoramidite 

5 , -dimethoxytrityl-N-p-isopropyl-phenoxyacetyl-guanosine,3 ? -[niethyl-(N,N-diisopropyl)]- 
20 phosphoramidite 

5'-dimethoxytrityl-2 f -deoxythymidi 

5 , -dimethoxytrityl-N-benzoyl-2 , -deoxyadenosine,3 -H-phosphonate, TEA salt 

25 

5 ? -dimethoxytrityl-N-benzoyl-2'~deoxycytidine ) 3 , -H-phosphonate 5 DBU salt 
5 , -dimethoxytrityl-N-isobutyryl-2 , -deoxyguanosine,3 , -H-phosphonate, TEA salt 

30 5 , -dimethoxytrityl-2 l -deoxythymidine,3 , -H-phosphonate, TEA salt 

5'-Dimethoxytrityl-5-methyl-pyrimidin-2-one-2 , -deoxyriboside, 3-[(2-cyanoethyl)-(N,N- 
diisopropyl)]-phosphoramidite (5-Me-2 , -deoxyZebularine) 

5'-Dimethoxytrityl-2'-deoxyNebularine,3 , -[(2-cyanoethyl)-(N,N-diisopropyl)]- 
35 phosphoramidite (5-Me-2'-deoxyNebularine) 

5'-Dimethoxytrityl-N-acetyl-deoxyCytidine,2 , -fluoro-3'-[(2-cyanoethyl)- 
(N,N-diisopropyl)]-phosphoramidite 

40 5 , -Dimethoxytrityl-deoxyUridine,2 , -fluoro-3 , ~[(2-cyanoethyl)- 
(N,N-diisopropyl)]-phosphoramidite 
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5 -Dimethoxytrityl-Uridin^ 
(N ? N-diisopropyl)]-phosphoramidite 

5 f -Dimethoxytrityl-N-benzoyl-3-deoxyAdenosine,2 , -[(2-cyanoethyl)- 
5 (N,N-diisopropyl)] -phosphoramidite 

5 -Dimethoxytrityl-N-benzoyl-3 f -deoxyCytidine 5 3 , -[(2-cyanoethyl)- 
(N,N-diisopropyl)]-phosphoramidite 

10 

S'-Dimethoxylxityl-N-dimethylformamidine-S'-deoxyGuanosine, 
3 , -[(2-cyanoethyl)-(N,N-diisopropyl)]-phosphoramidite 

5 ? -Dimethoxytrityl-3'-deoxyThymidine > 2 , -[(2-cyanoethyl)- 
1 5 (N,N-diisopropyl)] -phosphoramidite 

5 , -Dimethoxytrityl-N-acetyl-2 , -0-acetyl-Cytisine Arabinoside, 
3-[(2-cyanoethyl)-(N,N-diisopropyl)]-phosphoramidite 

20 In another aspect, the nucleoside can be zebularine, 5-methylzebularine, 4- 

fluorozebularine, 2'-deoxy, 2'-fluorozebularine, nebularine, 8-azanebularine, coformycin, 
3,4,5,6-tetrahydro-2'-deoxyuridine, 5-fluorozebularine, diazepinone riboside, or inosine. 

Also described herein are nucleosides that, when incorportated into a polymeric 
substrate, can facilitate the inhibition of cytidine deaminases. In one aspect, nucleoside has 

25 the formula V 



X 1 




V 



wherein 
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R 1 , R 2 , and R 3 can be, independently, hydrogen, hydroxyl, alkoxy, halo, alkyl, acyl, 
aryl, aralkyl, monophosphate, diphosphate, triphosphate , phosphorothioate, 
phosphoramidate, phosphonate, or a 2'-5' phosphodiester linkage, N 3 , NR 4 R 5 , NO2, 
NOR 6 , CN, -C(0)NH 2 , SH, -S-alkyl, -S-aryl, Se-alkyl, or Se-aryl, wherein at least 
one of R 2 or R 3 is hydroxyl; 

wherein R 4 , R 5 , and R 6 can be, independently, alkyl, aryl, aralkyl, alkaryl, acyl, or 
hydrogen; 

X 1 and X 2 can be electron-withdrawing groups, and 

Z can be nitrogen or CR 7 , wherein R 7 can be hydrogen, hydroxyl, alkoxy, halo, 

alkyl, acyl, aryl, or aralkyl, 
wherein the nucleoside is not uracil, thymine, cytosine, zebularine. 

Any of the electron-withdrawing groups described above can be used in formula V. 
In one aspect, X 1 and X 2 can be a nitro group, a halo group such as fluoro, chloro, bromo, or 
iodo, a cyano group, an ester group, an aldehyde group, a keto group, a sulfone group, an 
amide group, an imino group, an alkenyl group, a hydroxyl group, or a combination thereof. 
In another aspect, R 1 and R 2 are hydroxyl, and R 3 can be hydroxyl, halo, or alkoxy. In a 
further aspect, Z is CH. The nuclecoside having the formula V can have any combination 
of these aspects. In one aspect, the nucleoside having the formula V is 4-fluorozebularine 
or2'-deoxy, 2'-fluorozebularine. 

In another aspect, the nucleoside has the formula VI or VII 
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R 1 , R 2 , and R 3 can be, independently, hydrogen, hydroxyl, alkoxy, halo, alkyl, acyl, 
aryl, aralkyl, monophosphate, diphosphate, triphosphate , phosphorothioate, 
phosphoramidate, phosphonate, or a 2'-5' phosphodiester linkage, N3, NR 4 R 5 , NO2, 
NOR 6 , CN, -C(0)NH 2 , SH, -S-alkyl, -S-aryl, Se-alkyl, or Se-aryl, wherein at least 
5 one of R 2 or R 3 is hydroxyl; 

X 1 and X 2 can be electron-withdrawing groups; 
W can be O, S, Se, or NR 10 ; 

wherein R 4 , R 5 , R 6 , R 8 , R 9 , and R 10 can be, independently, alkyl, aryl, aralkyl, 
alkaryl, acyl, or hydrogen; and 
10 Z can be nitrogen or CR 7 , wherein R 7 can be hydrogen, hydroxyl, alkoxy, halo, 

alkyl, acyl, aryl, or aralkyl. 

The nucleosides having the formulas VI and VII can be readily prepared by reacting 
nucleosides having the formula I with a nucleophile. For example, a nucleoside having the 
formula I can be reacted with an alcohol (R 10 OH) or alkoxide (R 10 O~) to produce a 

15 nucleoside having the formula VI or VII. In one aspect, X and X in formulae VI and VII 
can be any of the electron-withdrawing groups described above. In one aspect, when the 
nucleoside is formula VI or VII, R 2 is hydroxyl and R 3 can be hydroxyl, halo, or alkoxy. In 
another aspect, when the nucleoside has the formula VI or VII, X or X 2 can be halo such as 
fluoro, chloro, bromo, or iodo. In a further aspect, when the nucleoside is VI or VII, Z is 

20 CH. The nuclecoside having the formula VI or VII can have any combination of these 
aspects. 

Disclosed are cytidine deaminase inhibitors that can comprise a nucleoside as 
described above, incorporated in a polymeric substrate, wherein the polymeric substrate 
targets a cytidine deaminase. The polymeric substrate can comprise an oligonucleotide or a 

25 polynucleotide, and can be 2, 3, 4, 5, 6, 7, 8, 9, 10, 1 1, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 
22, 23, ,24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 
46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 
70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 
94, 95, 96, 97, 98, 99, or 100 or more residues in length. Particularly, the nucleic acid 

30 residues can comprise up to about 1 8 residues. 

The nucleic acids of the cytidine deaminase inhibitor can be any length, particularly 
from about 2 to about 1000 residues, or any amount in between, from about 5 to about 500 
residues, from about 10 to about 100 residues, or from about 15 to about 50 residues, or in 
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particular about 18-20 residues. Large oligos can be made from smaller pieces joined 

together by 5'-I to 3'-phosphorothioate ligation as described by Y.Z. Xu and E.T. Kool, 

Tetrahedron Lett., 1997, 38, 5595-5598, herein incorporated in its entirety by reference. 

The nucleic acids of the cytidine deaminase inhibitor can comprise RNA residues 

5 from a single stranded RNA, a double stranded RNA, or a single stranded replication 

bubble. Alternatively, the nucleic acids also comprise DNA residues. The DNA residues can 

form a single stranded DNA, a double stranded DNA, or a single stranded replication 

bubble. Furthermore, the nucleic acids of the cytidine deaminase inhibitor can comprise 

stretches of RNA and DNA alternating on the same strand of a single stranded 

10 oligonucleotide, or on double strands. Furthermore, in a double stranded oligonucleotide, 

one strand can comprise RNA, while the other comprises DNA. 

Optimally, the targeted cytidine deaminase can be APOBEC-1, which is encoded by 

the following nucleic acid sequence: 

gatcccagag gaggaagtcc agagacagag caccatgact tctgagaaag gagaagaatc 
15 gaaccctggg agtttgacgt cttctatgac cccagagaac ttcgtaaaga ggcctgtctg 

ctctacgaaa tcaagtgggg catgagccgg aagatctggc gaagctcagg caaaaacacc 
accaatcacg tggaagttaa ttttataaaa aaatttacgt cagaaagaga ttttcaccca 
tccatcagct gctccatcac ctggttcttg tcctggagtc cctgctggga atgctcccag 
gctattagag agtttctgag tcggcaccct ggtgtgactc tagtgatcta cgtagctcgg 
20 cttttttggc acatggatca acaaaatcgg caaggtctca gggaccttgt taacagtgga 

gtaactattc agattatgag agcatcagag tattatcact gctggaggaa ttttgtcaac 
tacccacctg gggatgaagc tcactggcca caatacccac ctctgtggat gatgttgtac 
gcactggagc tgcactgcat aattctaagt cttccaccct gtttaaagat ttcaagaaga 
tggcaaaatc atcttacatt tttcagactt catcttcaaa actgccatta ccaaacgatt 
25 ccgccacaca tccttttagc tacagggctg atacatcctt ctgtggcttg gagatgaata 

ggatgattcc gtgtgtgtac tgattcaaga acaagcaatg atgacccact aaagagtgaa 
tgccatttag aatctagaaa tgttcacaag gtaccccaaa actctgtagc ttaaaccaac 
aataaatatg tattacctct ggc (SEQ ID NO: 2) 

30 The targeted cytidine deaminase of the cytidine deaminase inhibitor can also be an 

APOBEC-related protein, such as AID. The full length nucleic acid sequence that encodes 

r 

AID is: 

atggacagcc tcttgatgaa ccggaggaag tttctttacc aattcaaaaa tgtccgctgg gctaagggtc 
ggcgtgagac ctacctgtgc tacgtagtga agaggcgtga cagtgctaca tccttttcac tggactttgg 

35 ttatcttcgc aataagaacg gctgccacgt ggaattgctc ttcctccgct acatctcgga ctgggaccta 

gaccctggcc gctgctaccg cgtcacctgg ttcacctcct ggagcccctg ctacgactgt gcccgacatg 
tggccgactt tctgcgaggg aaccccaacc tcagtctgag gatcttcacc gcgcgcctct acttctgtga 
ggaccgcaag gctgagcccg aggggctgcg gcggctgcac cgcgccgggg tgcaaatagc catcatgacc 
ttcaaagatt atttttactg ctggaatact tttgtagaaa accatgaaag aactttcaaa gcctgggaag 

40 ggctgcatga aaattcagtt cgtctctcca gacagcttcg gcgcatcctt ttgcccctgt atgaggttga 

tgacttacga gacgcatttc gtactttggg actttga (SEQ ID NO: 6) 

35 
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The cytidine deaminase inhibitor for AID can comprise WRZY (SEQ ID NO: 7), 
wherein W is A or T, wherein R is a purine, wherein Z is the nucleoside, and wherein Y is a 
pyrimidine. 

5 The cytidine deaminase inhibitor can also be directed towards CEM15. The full 

length nucleic acid that encodes CEM15 (APOBEC-3G) is represented by: 

ctgccagggggagggccccagagaaaaccagaaagagggtgagagactgaggaagataaagcgtcccagg 

gcctcctacaccagcgcctgagcaggaagcgggaggggccatgactacgaggccctgggaggtcacttta 

gggagggctgtcctaaaaccagaagcttggagcagaaagtgaaaccctggtgctccagacaaagatctta 

10 gtcgggactagccggccaaggatgaagcctcacttcagaaacacagtggagcgaatgtatcgagacacat 
tctcctacaacttttataatagacccatcctttctcgtcggaataccgtctggctgtgctacgaagtgaa 
aacaaagggtccctcaaggccccctttggacgcaaagatctttcgaggccaggtgtattccgaacttaag 
taccacccagagatgagattcttccactggttcagcaagtggaggaagctgcatcgtgaccaggagtatg 
aggtcacctggtacatatcctggagcccctgcacaaagtgtacaagggatatggccacgttcctggccga 

15 ggacccgaaggttaccctgaccatcttcgttgcccgcctctactacttctgggacccagattaccaggag 
gcgcttcgcagcctgtgtcagaaaagagacggtccgcgtgccaccatgaagatcatgaattatgacgaat 
ttcagcactgttggagcaagttcgtgtacagccaaagagagctatttgagccttggaataatctgcctaa 
atattatatattactgcacatcatgctgggggagattctcagacactcgatggatccacccacattcact 
ttcaactttaacaatgaaccttgggtcagaggacggcatgagacttacctgtgttatgaggtggagcgca 

20 tgcacaatgacacctgggtcctgctgaaccagcgcaggggctttctatgcaaccaggctccacataaaca 
cggtttccttgaaggccgccatgcagagctgtgcttcctggacgtgattcccttttggaagctggacctg 
gaccaggactacagggttacctgcttcacctcctggagcccctgcttcagctgtgcccaggaaatggcta 
aattcatttcaaaaaacaaacacgtgagcctgtgcatcttcactgcccgcatctatgatgatcaaggaag 
atgtcaggaggggctgcgcaccctggccgaggctggggccaaaatttcaataatgacatacagtgaattt 

25 aagcactgctgggacacctttgtggaccaccagggatgtcccttccagccctgggatggactagatgagc 
acagccaagacctgagtgggaggctgcgggccattctccagaatcaggaaaactgaaggatgggcctcag 
tctctaaggaaggcagagacctgggttgagcctcagaataaaagatcttcttccaagaaatgcaaacagg 
ctgttcaccaccatctccagctgatcacagacaccagcaaagcaatgcactcctgaccaagtagattctt 
ttaaaaattagagtgcattactttgaatcaaaaatttatttatatttcaagaataaagtactaagattgt 

30 gctcaatacacagaaaagtttcaaacctactaatccagcgacaatttgaatcggttttgtaggtagagga 
ataaaatgaaatactaaatctttctgtaaaaaaaaaa (SEQ IDNO: 10) 

Representative sequences recognized by CEM15 include first strand DNA or 

portions thereof, including ten or more sequences comprising those produced by reverse 

35 transcription from HIV-1 : 

ggtctctctggttagaccagatttgagcctgggagctctctggctaactagggaacccactgcttaagcc 
tcaataaagcttgccttgagtgcttcaagtagtgtgtgcccgtctgttgtgtgactctggtaactagaga 
tccctcagacccttttagtcagtgtggaaaatctctagcagtggcgcccgaacagggacttgaaagcgaa 
agggaaaccagaggagctctctcgacgcaggactcggcttgctgaagcgcgcacggcaagaggcgagggg 

40 aggcgactggtgagtacgccaaaaattttgactagcggaggctagaaggagagagatgggtgcgagagcg 
tcagtattaagcgggggagaattagatcgatgggaaaaaattcggttaaggccagggggaaagaaaaaat 
ataaattaaaacatatagtatgggcaagcagggagctagaacgattcgcagttaatcctggcctgttaga 
aacatcagaaggctgtagacaaatactgggacagctacaaccatcccttcagacaggatcagaagaactt 
agatcattatataatacagtagcaaccctctattgtgtgcatcaaaggatagagataaaagacaccaagg 

45 aagctttagacaagatagaggaagagcaaaacaaaagtaagaaaaaagcacagcaagcagcagctgacac 
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aggacacagcagccaggtcagccaaaattaccctatagtgcagaacatccaggggcaaatggtacatcag 

gccatatcacctagaactttaaatgcatgggtaaaagtagtagaagagaaggctttcagcccagaagtga 

tacccatgttttcagcattatcagaaggagccaccccacaagatttaaacaccatgctaaacacagtggg 

gggacatcaagcagccatgcaaatgttaaaagagaccatcaatgaggaagctgcagaatgggatagagtg 

catccagtgcatgcagggcctattgcaccaggccagatgagagaaccaaggggaagtgacatagcaggaa 

ctactagtacccttcaggaacaaataggatggatgacaaataatccacctatcccagtaggagaaattta 

taaaagatggataatcctgggattaaataaaatagtaagaatgtatagccctaccagcattctggacata 

agacaaggaccaaaagaaccctttagagactatgtagaccggttctataaaactctaagagccgagcaag 

cttcacaggaggtaaaaaattggatgacagaaaccttgttggtccaaaatgcgaacccagattgtaagac 

tattttaaaagcattgggaccagcagctacactagaagaaatgatgacagcatgtcagggagtgggagga 

cccggccataaggcaagagttttggctgaagcaatgagccaagtaacaaattcagctaccataatgatgc 

aaagaggcaattttaggaaccaaagaaagattgttaagtgtttcaattgtggcaaagaagggcacatagc 

cagaaattgcagggcccctaggaaaaagggctgttggaaatgtggaaaggaaggacaccaaatgaaagat 

tgtactgagagacaggctaattttttagggaagatctggccttcctacaagggaaggccagggaattttc 

ttcagagcagaccagagccaacagccccaccatttcttcagagcagaccagagccaacagccccaccaga 

agagagcttcaggtctggggtagagacaacaactccctctcagaagcaggagccgatagacaaggaactg 

tatcctttaacttccctcagatcactctttggcaacgacccctcgtcacaataaagataggggggcaact 

aaaggaagctctattagatacaggagcagatgatacagtattagaagaaatgagtttgccaggaagatgg 

aaaccaaaaatgatagggggaattggaggttttatcaaagtaagacagtatgatcagatactcatagaaa 

tctgtggacataaagctataggtacagtattagtaggacctacacctgtcaacataattggaagaaatct 

gttgactcagattggttgcactttaaattttcccattagtcctattgaaactgtaccagtaaaattaaag 

ccaggaatggatggcccaaaagttaaacaatggccattgacagaagaaaaaataaaagcattagtagaaa 

tttgtacagaaatggaaaaggaagggaaaatttcaaaaattgggcctgaaaatccatacaatactccagt 

atttgccataaagaaaaaagacagtactaaatggagaaaattagtagatttcagagaacttaataagaga 

actcaagacttctgggaagttcaattaggaataccacatcccgcagggttaaaaaagaaaaaatcagtaa 

cagtactggatgtgggtgatgcatatttttcagttcccttagatgaagacttcaggaagtatactgcatt 

taccatacctagtataaacaatgagacaccagggattagatatcagtacaatgtgcttccacagggatgg 

aaaggatcaccagcaatattccaaagtagcatgacaaaaatcttagagccttttagaaaacaaaatccag 

acatagttatctatcaatacatggatgatttgtatgtaggatctgacttagaaatagggcagcatagaac 

aaaaatagaggagctgagacaacatctgttgaggtggggacttaccacaccagacaaaaaacatcagaaa 

gaacctccattcctttggatgggttatgaactccatcctgataaatggacagtacagcctatagtgctgc 

cagaaaaagacagctggactgtcaatgacatacagaagttagtgggaaaattgaattgggcaagtcagat 

ttacccagggattaaagtaaggcaattatgtaaactccttagaggaaccaaagcactaacagaagtaata 

ccactaacagaagaagcagagctagaactggcagaaaacagagagattctaaaagaaccagtacatggag 

tgtattatgacccatcaaaagacttaatagcagaaatacagaagcaggggcaaggccaatggacatatca 

aatttatcaagagccatttaaaaatctgaaaacaggaaaatatgcaagaacgaggggtgcccacactaat 

gatgtaaaacaattaacagaggcagtgcaaaaaataaccacagaaagcatagtaatatggggaaagactc 

ctaaatttaaactacccatacaaaaggaaacatgggaaacatggtggacagagtattggcaagccacctg 

gattcctgagtgggagtttgtcaatacccctcctttagtgaaattatggtaccagttagagaaagaaccc 

atagtaggagcagaaacgttctatgtagatggggcagctagcagggagactaaattaggaaaagcaggat 

atgttactaatagaggaagacaaaaagttgtcaccctaactgacacaacaaatcagaagactgagttaca 

agcaattcatctagctttgcaggattcgggattagaagtaaatatagtaacagactcacaatatgcatta 

ggaatcattcaagcacaaccagataaaagtgaatcagagttagtcaatcaaataatagagcagttaataa 

aaaaggaaaaggtctatctggcatgggtaccagcacacaaaggaattggaggaaatgaacaagtagataa 

attagtcagtgctggaatcaggaaagtactatttttagatggaatagataaggcccaagatgaacatgag 

aaatatcacagtaattggagagcaatggctagtgattttaacctgccacctgtagtagcaaaagaaatag 

tagccagctgtgataaatgtcagctaaaaggagaagccatgcatggacaagtagactgtagtccaggaat 

atggcaactagattgtacacatttagaaggaaaagttatcctggtagcagttcatgtagccagtggatat 

atagaagcagaagttattccagcagaaacagggcaggaaacagcatactttcttttaaaattagcaggaa 
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gatggccagtaaaaacaatacatacagacaatggcagcaatttcaccagtactacggttaaggccgcctg 

ttggtgggcgggaatcaagcaggaatttggaattccctacaatccccaaagtcaaggagtagtagaatct 

atgaataaagaattaaagaaaattataggccaggtaagagatcaggctgaacatcttaagacagcagtac 

aaatggcagtattcatccacaattttaaaagaaaaggggggattggggggtacagtgcaggggaaagaat 

agtagacataatagcaacagacatacaaactaaagaattacaaaaacaaattacaaaaattcaaaatttt 

cgggtttattacagggacagcagagatccactttggaaaggaccagcaaagctcctctggaaaggtgaag 

gggcagtagtaatacaagataatagtgacataaaagtagtgccaagaagaaaagcaaagatcattaggga 

ttatggaaaacagatggcaggtgatgattgtgtggcaagtagacaggatgaggattagaacatggaaaag 

tttagtaaaacaccatatgtatgtttcagggaaagctaggggatggttttatagacatcactatgaaagc 

cctcatccaagaataagttcagaagtacacatcccactaggggatgctagattggtaataacaacatatt 

ggggtctgcatacaggagaaagagactggcatctgggtcagggagtctccatagaatggaggaaaaagag 

atatagcacacaagtagaccctgaactagcagaccaactaattcatctgtattactttgactgtttttca 

gactctgctataagaaaggccttattaggacatatagttagccctaggtgtgaatatcaagcaggacata 

acaaggtaggatctctacaatacttggcactagcagcattaataacaccaaaaaagataaagccaccttt 

gcctagtgttacgaaactgacagaggatagatggaacaagccccagaagaccaagggccacagagggagc 

cacacaatgaatggacactagagcttttagaggagcttaagaatgaagctgttagacattttcctaggat 

ttggctccatggcttagggcaacatatctatgaaacttatggggatacttgggcaggagtggaagccata 

ataagaattctgcaacaactgctgtttatccatttcagaattgggtgtcgacatagcagaataggcgtta 

ctcaacagaggagagcaagaaatggagccagtagatcctagactagagccctggaagcatccaggaagtc 

agcctaaaactgcttgtaccacttgctattgtaaaaagtgttgctttcattgccaagtttgtttcacaac 

aaaagccttaggcatctcctatggcaggaagaagcggagacagcgacgaagacctcctcaaggcagtcag 

actcatcaagtttctctatcaaagcagtaagtagtacatgtaatgcaacctatacaaatagcaatagcag 

cattagtagtagcaataataatagcaatagttgtgtggtccatagtaatcatagaatataggaaaatatt 

aagacaaagaaaaatagacaggttaattgatagactaatagaaagagcagaagacagtggcaatgagagt 

gaaggagaaatatcagcacttgtggagatgggggtggaaatggggcaccatgctccttgggatattgatg 

atctgtagtgctacagaaaaattgtgggtcacagtctattatggggtacctgtgtggaaggaagcaacca 

ccactctattttgtgcatcagatgctaaagcatatgatacagaggtacataatgtttgggccacacatgc 

ctgtgtacccacagaccccaacccacaagaagtagtattggtaaatgtgacagaaaattttaacatgtgg 

aaaaatgacatggtagaacagatgcatgaggatataatcagtttatgggatcaaagcctaaagccatgtg 

taaaattaaccccactctgtgttagtttaaagtgcactgatttggggaatgctactaataccaatagtag 

taataccaatagtagtagcggggaaatgatgatggagaaaggagagataaaaaactgctctttcaatatc 

agcacaagcataagaggtaaggtgcagaaagaatatgcatttttttataaacttgatataataccaatag 

ataatgatactaccagctatacgttgacaagttgtaacacctcagtcattacacaggcctgtccaaaggt 

atcctttgagccaattcccatacattattgtgccccggctggttttgcgattctaaaatgtaataataag 

acgttcaatggaacaggaccatgtacaaatgtcagcacagtacaatgtacacatggaattaggccagtag 

tatcaactcaactgctgttgaatggcagtctagcagaagaagaggtagtaattagatctgccaatttcac 

agacaatgctaaaaccataatagtacagctgaaccaatctgtagaaattaattgtacaagacccaacaac 

aatacaagaaaaagtatccgtatccagaggggaccagggagagcatttgttacaataggaaaaataggaa 

atatgagacaagcacattgtaacattagtagagcaaaatggaatgccactttaaaacagatagctagcaa 

attaagagaacaatttggaaataataaaacaataatctttaagcaatcctcaggaggggacccagaaatt 

gtaacgcacagttttaattgtggaggggaatttttctactgtaattcaacacaactgtttaatagtactt 

ggtttaatagtacttggagtactgaagggtcaaataacactgaaggaagtgacacaatcacactcccatg 

cagaataaaacaatttataaacatgtggcaggaagtaggaaaagcaatgtatgcccctcccatcagcgga 

caaattagatgttcatcaaatattacagggctgctattaacaagagatggtggtaataacaacaatgggt 

ccgagatcttcagacctggaggaggagatatgagggacaattggagaagtgaattatataaatataaagt 

agtaaaaattgaaccattaggagtagcacccaccaaggcaaagagaagagtggtgcagagagaaaaaaga 

gcagtgggaataggagctttgttccttgggttcttgggagcagcaggaagcactatgggcgcacggtcaa 

tgacgctgacggtacaggccagacaattattgtctggtatagtgcagcagcagaacaatttgctgagggc 

tattgaggcgcaacagcatctgttgcaactcacagtctggggcatcaagcagctccaggcaagaatcctg 
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gctgtggaaagatacctaaaggatcaacagctcctggggatttggggttgctctggaaaactcatttgca 
ccactgctgtgccttggaatgctagttggagtaataaatctctggaacagatttggaataacatgacctg 
gatggagtgggacagagaaattaacaattacacaagcttaatacattccttaattgaagaatcgcaaaac 
cagcaagaaaagaatgaacaagaattattggaattagataaatgggcaagtttgtggaattggtttaaca 
5 taacaaattggctgtggtatataaaaatattcataatgatagtaggaggcttggtaggtttaagaatagt 
ttttgctgtactttctatagtgaatagagttaggcagggatattcaccattatcgtttcagacccacctc 
ccaaccccgaggggacccgacaggcccgaaggaatagaagaagaaggtggagagagagacagagacagat 
ccattcgattagtgaacggatccttagcacttatctgggacgatctgcggagcctgtgcctcttcagcta 
ccaccgcttgagagacttactcttgattgtaacgaggattgtggaacttctgggacgcagggggtgggaa 

10 gccctcaaatattggtggaatctcctacagtattggagtcaggaactaaagaatagtgctgttagcttgc 

tcaatgccacagccatagcagtagctgaggggacagatagggttatagaagtagtacaaggagcttgtag 
agctattcgccacatacctagaagaataagacagggcttggaaaggattttgctataagatgggtggcaa 
gtggtcaaaaagtagtgtggttggatggcctactgtaagggaaagaatgagacgagctgagccagcagca 
gatggggtgggagcagcatctcgagacctggaaaaacatggagcaatcacaagtagcaatacagcagcta 

15 ccaatgctgcttgtgcctggctagaagcacaagaggaggaggaggtgggttttccagtcacacctcaggt 
acctttaagaccaatgacttacaaggcagctgtagatcttagccactttttaaaagaaaaggggggactg 
gaagggctaattcactcccaacgaagacaagatatccttgatctgtggatctaccacacacaaggctact 
tccctgattggcagaactacacaccagggccaggggtcagatatccactgacctttggatggtgctacaa 
gctagtaccagttgagccagataaggtagaagaggccaataaaggagagaacaccagcttgttacaccct 

20 gtgagcctgcatggaatggatgaccctgagagagaagtgttagagtggaggtttgacagccgcctagcat 
ttcatcacgtggcccgagagctgcatccggagtacttcaagaactgctgacatcgagcttgctacaaggg 
actttccgctggggactttccagggaggcgtggcctgggcgggactggggagtggcgagccctcagatgc 
tgcatataagcagctgctttttgcctgtactgggtctctctggttagaccagatttgagcctgggagctc 
tctggctaactagggaacccactgcttaagcctcaataaagcttgccttgagtgcttca (SEQ ID NO: 4) 

25 

Other targets for CEM-15 can include the RNA and first strand DNA from: simian 
immunodeficiency virus (SIV), equine infectious anemia virus (EIAV), and murine 
leukemia virus, as well as other human Antiviruses, and retroviruses such as T-cell 
leukemia virus, spleen necrosis virus, and the pararetorvirus hepatitis B virus. 

30 

Mooring sequences 

Mooring sequences are the nucleic acid sequences recognized by cytidine 
deaminases. These sequences act as the recognition site for the enzyme, which can then 
deaminate a cytidine that is within a given length of the mooring sequence. 

35 In APOBEC-1, RNA secondary structure does not appear to be required for apoB 

RNA editing. Instead, apoB mRNA editing requires an 1 1 nucleotide motif known as the 
mooring sequence. Placement of the mooring sequence 4-8 nucleotides 3' of a cytidine 
within reporter RNAs is frequently sufficient for that RNA to support editing (Smith (1993) 
Seminars in Cell Biol 4, 267-78; Sowden (1998) Nucl Acids Res. 26, 1644-1652; Backus 

40 and Smith (1992) Nucl Acids Res. 22, 6007-14; Backus and Smith (1991) Nucl Acids Res. 
19, 6781-86; Backus and Smith (1994) Biochim. Biophys. Acta 1217, 65-73; Backus and 
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Smith (1994) Biochim. Biophys. Acta 1219, 1-14; Sowden et al., (1996) RNA 2, 274-88). 
The mooring sequence is left intact in edited mRNA and therefore its occurrence 
downstream of a cytidine is predictive of an editing site. 

Alone APOBEC-1 exhibits only nominal affinity for AU-rich sequences within the 
region of the apoB mRNA mooring sequence (Anant and Davidson (2000) Mol. Cell Biol. 
20, 1982-1992, incorporated by reference in its entirety). The consensus sequence for 
APOBEC-1 binding is: 5'UUUN[A/U]U-3' (SEQ ID NO: 1 1) The mooring sequence 
described by Smith (1993) Seminars in Cell Biology 4, 267-278), herein incorporated by 
reference) is: 

5'-C*AAuu[uGauCAGUAUA]uu-3' (SEQ ID NO: 12) 

(where C* is the cytidine to be edited) and is highly conserved among 32 mammalian 
sequences (Hersberger et al & Innerarity, (1999) J. Biol Chem. 274, 34590-34597, herein 
incorporated by reference in its entirety for teachings related to mooring sequences). 
Regions in lower case are specific "high affinity" sequences for APOBEC-1 binding. 
Regions in brackets define the "mooring sequence." A cytidine deaminase inhibitor can 
utilize both the upstream sequence at the edited C (denoted by * and would be substituted 
with a zebularine or other cytidine deaminase inhibitor nucleotide) as well as downstream 
regions including the mooring sequence. The minimal sequence size capable of supporting 
editing (and hence and effective cytidine deaminase inhibitor of APOBEC-1 is >_about a 
26-mer) having a nucleic acid sequence: 

5'-GAUA(Z*)AAUU[UGAUCAGUAUA]U-3' (SEQ ID NO: 13). 

Auxiliary proteins 

Disclosed are cytidine deaminase inhibitors comprising a nucleoside incorporated in 
a polymeric substrate, wherein the polymeric substrate targets an auxiliary protein of a 
cytidine deaminase. An auxiliary protein of a cytidine deaminase is a protein that assists the 
cytidine deaminase in nucleic acid recognition, and specifically in recognized editing sites. 

One example of an auxiliary protein of a cytidine deaminase is APOBEC-1 
Complementing Factor (ACF). The nucleotide sequence recognized by ACF is the mooring 
sequence RNA disclosed above (SEQ ID NO: 13). 



40 



WO 2005/115410 



PCT/US2005/016001 



Another example of an auxiliary protein of a cytidine deaminase is APOBEC-1 
Stimulating Protein (ASP). ASP recognizes the mooring sequence and is a spliced variant of 
ACF (Dance et al., 2002) J. Biol Chem. 277, 12703-09). Additional ACF spliced variants 
are ACF45 and ACF43 that also bind to the mooring sequence (Sowden et al., (2004) J. 
5 Biol Chem. 279, 197-206) 

Other mRNA sequences targeted by APOBEC-1 which can be inhibited by cytidine 
deaminase inhibitor include, but are not limited, to NF1 (neurofibromin 1), whose targeted 
sequence is 

5'-CCUUUAUUA(Z*)GAAUUGUGAUCACAUCCUCUG-3' (SEQ ID NO: 14), 

10 wherein "Z" is the nucleoside residue. 

Unregulated transgenic expression of APOBEC-1 confers susceptibility to liver cancer 
(Sowden, M., (1996) J Biol Chem, 271,:301 1-7; Yamanaka, S., et al., (1995) PNAS USA, 
92, 8483-7, herein incorporated by reference in their entireties,), possibly as a result of 
nonspecific mRNA editing affecting the NAT1 translational repressor. The following 
15 human NAT-1 (novel APOBEC-1 target) sequences are listed with each "mooring 
sequence" underlined. 

5'-aagcaagttztttgatcagtttactcaaaca-3' (SEQ ID NO: 15) 
20 5 '-aaatgctgczagaaa ttgatcagaata agga-3 ' (SEQ ID NO: 16) 
5'-ttacttgcazcaaactgatcagttttgagag-3' (SEQ ID NO: 17) 

Yamanaka et al. looked at several proteins and reported editing of protein tyrosine kinase 
25 (TEC). Only the latter was edited 

Mouse TEC sequence 

5'-aggtacgttztggatgatcagtacacaagttc-3' (SEQ ID NO: 18) 

30 Corresponding HUMAN TEC Sequence 

5'-aggtatgttztggatgatcagtacacaagttc-3' (SEQ ID NO: 19) 

APOBEC-1 relies on auxiliary proteins for RNA recognition (Grosjean and Benne 
(1998); Teng (1993) Science 260:1816-19; Sowden (1998) Nucl. Acids Res. 26:1644-52; 
35 Inui (1994) J. Lipid Res. 35:1477-89; Dance (2001) Nucl. Acids Res. 29:1772-80). 

APOBEC-1 only has weak RNA binding activity of low specificity (Anant (1995) J. Biol 
Chem 270, 14768-75; MacGinnitie (1995) J. Biol Chem, 270, 14768-75). To edit apoB 
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mRNA, APOBEC-1 requires, in addition to the mooring sequence described above, RNA 
binding proteins that bind apoB mRNA and to which APOBEC-1 can bind and orient itself 
to C6666. The components of the minimal editosome from defined in vitro system analyses 
are APOBEC-1 as a homodimeric cytidine deaminase (Lau, P.P., (1994) PNAS U.S.A. 91, 
5 8522-8526) bound to the auxiliary protein ACF/ASP that serves as the editing-site 

recognition factor through its mooring-sequence-selective RNA-binding activity (Mehta, 
A., (2000) Mol. Cell. Biol. 20, 1846-1854; Lellek, H., (2000) J. Biol. Chem. 275, 19848- 
19856). Several other auxiliary protein candidates have also been described that had binding 
affinities for APOBEC-1 and/or apoB mRNA and that demonstrated the ability to modulate 

10 editing efficiency (Giannoni, F., et al., (1994) J. Biol. Chem. 269, 5932-5936;Yamanaka, S., 
et al., (1994) J. Biol. Chem. 269,21725-21734; Yang, Y., et a;., (1997) J. Biol. Chem. 272, 
27700-27706; Lellek, H., et al., (2000) J. Biol Chem. 275, 19848-19856; Teng, B., et al., 
(1993) Science 260, 1816-1819; Inui, Y., et al., (1994) J. Lipid Res. 35, 1477-1489; Anant, 
S.G., et al., (1997) Nucleic Acids Symp. Ser. 36, 1 1 5-118; Lau, P.P., et al., (1997) J. Biol. 

15 Chem. 272, 1452-1455). 

ACF was isolated and cloned using biochemical fractionation and yeast two hybrid 
genetic selection (Mehta et al., (2000) Mol Cell Biol. 20, 1846-54; Lellek (2000) JBC 
275:19848-56). Overexpression of 6His-tagged APOBEC-1 in mammalian cells enabled the 
intracellular^ assembled editosome to be affinity purified (Yang Y. et al., (1997) J. Biol 

20 Chem. 272, 27700-06). ACF associated with APOBEC-1 through 1M NaCl resistant 

interactions and that three other RNA binding proteins (100 kDa, 55 kDa and 44 kDa) with 
affinity for the mooring sequence co-purified with the editosome (Yang, Y. et al., (1997) J. 
Biol Chem. 272, 27700-06). PI 00 and p55 were both mooring-sequence-selective RNA 
binding proteins but p44 was a general RNA binding protein. Additional studies utilizing 

25 yeast two hybrid analyses using APOBEC-1 affinity and antibodies developed against the 
editosome and ACF have demonstrated proteins such as hnRNP ABBP1 (Lau, P. et al., 
(1997) J. Biol Chem 272, 1452-55), the alternative splicing factor KSRP (Lellek et al., 
(2000) J. Biol Chem 275, 19848-56) and aI3 serum proteinase inhibitor as positive 
modulators of editing activity (Schock, D. et al., (1996) PNAS USA 93, 1097-1 102) and 

30 hnRNP protein C (Greeve, J. et al.,(1998) Biol. Chem. 379, 1063-73) and GRY-RBP 
(Blanc, V. et al., (2001) J. Biol Chem, 276, 10272-83; Lau, P. et al., (2001) Biochem. 
Biophys. Res. Commun. 282, 977-83) as negative modulators of apoB mRNA editing. 
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Disclosed are compositions comprising a cytidine deaminase inhibitor, such as those 
described above, and a pharmaceutical carrier. 

The compositions can be administered in vivo in a pharmaceutical^ acceptable 
carrier. By "pharmaceutical^ acceptable" is meant a material that is not biologically or 
5 otherwise undesirable, i.e., the material may be administered to a subject, along with a 

nucleic acid or vector, without causing any undesirable biological effects or interacting in a 
deleterious manner with any of the other components of the pharmaceutical composition in 
which it is contained. The carrier would naturally be selected to minimize any degradation 
of the active ingredient and to minimize any adverse side effects in the subject, as would be 
10 well known to one of skill in the art. 
Delivery 

The disclosed compositions can be delivered to the target cells in a variety of ways. 

For example, the compositions can be delivered through electroporation, or through 

lipofection, or through calcium phosphate precipitation. The delivery mechanism chosen 
15 will depend in part on the type of cell targeted and whether the delivery is occurring for 

example in vivo or in vitro. 

Thus, the compositions can comprise, for example, lipids such as liposomes, such as 

cationic liposomes (e.g., DOTMA, DOPE, DC-cholesterol) or anionic liposomes. 

Liposomes can further comprise proteins to facilitate targeting a particular cell, if desired. 
20 Administration of a composition comprising a compound and a cationic liposome can be 

administered to the blood afferent to a target organ or inhaled into the respiratory tract to 

target cells of the respiratory tract. Regarding liposomes, see, e.g., Brigham et al. (1989) 

Am. X Resp. Cell Mol Biol. 1, 95-100); Feigner et al. (1987) PNAS USA 84, 7413-7417); 

U.S. Pat. No.4,897,355. Furthermore, the compound can be administered as a component 
25 of a microcapsule that can be targeted to specific cell types, such as macrophages, or where 

the diffusion of the compound or delivery of the compound from the microcapsule is 

designed for a specific rate or dosage. 

Disclosed are methods of treating a subject with cancer, or having a predisposition 

to cancer. Such cancers include, but are not limited to, those associated with AID, like B 
30 cell malignancies such as diffuse large B cell lymphomas and chronic lymphatic leukemia. 

The disorder can also be a T cell lymphoma or a pulmonary adenoma. Also included are 

those cancers associated with APOBEC-1, like colorectal cancer and neurofibromatosis. 

Also included are those cancers associated with CEM15, such as breast cancer. Cancers 
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involving overexpression of other ARPs of listed in Wedekind et aL (2003) Trends in 
Genetics, herein incorporated by reference in it's entirety. 

As used throughout, administration of a cytidine deaminase inhibitor can occur in 
conjunction with other therapeutic agents. Thus, the cytidine deaminase inhibitors of the 
5 present invention can be administered alone or in combination with one or more therapeutic 
agents. For example, a subject can be treated with a cytidine deaminase inhibitor alone, or 
in combination with chemotherapeutic agents, antibodies, antivirals, steroidal and non- 
steroidal anti-inflammatories, conventional immunotherapeutic agents, cytokines, 
chemokines, and/or growth factors. Combinations may be administered either 
10 concomitantly (e.g., as an admixture), separately but simultaneously (e.g., via separate 
intravenous lines into the same subject), or sequentially (e.g., one of the compounds or 
agents is given first followed by the second). Thus, the term "combination" or "combined" 
is used to refer to either concomitant, simultaneous, or sequential administration of two or 
more agents. 

15 A cytidine deaminase inhibitor is an agent that inhibits editing of nucleic acids by 

cytidine deaminases. Additional genetic, pharmacologic, or metabolic agents or conditions 
also modulate the RNA or DNA editing or mutating function of the deaminase. Some of 
the conditions that modulate editing activity include: (i) changes in the diet, (ii) hormonal 
changes (e.g., levels of insulin or thyroid hormone), (Hi) osmolality (e.g., hyper or hypo 

20 osmolality), (iv) ethanol, (v) inhibitors of RNA or protein synthesis and (yi) conditions that 
promote liver proliferation. Thus, the methods of the invention can comprise administering 
a cytidine deaminase inhibitor to the subject and using other conditions that reduce the 
efficiency of mRNA editing function or the efficacy of the cytidine deaminase. 

Disclosed are methods of treating a condition, wherein the condition is a cancer. 

25 The cancer can be selected from the group consisting of lymphomas (Hodgkins and non- 
Hodgkins), B cell lymphoma, T cell lymphoma, myeloid leukemia, leukemias, mycosis 
fungoides, carcinomas, carcinomas of solid tissues, squamous cell carcinomas, 
adenocarcinomas, sarcomas, gliomas, blastomas, neuroblastomas, plasmacytomas, 
histiocytomas, melanomas, adenomas, hypoxic tumours, myelomas, AIDS-related 

30 lymphomas or sarcomas, metastic cancers, bladder cancer, brain cancer, nervous system 
cancer, squamous cell carcinoma of head and neck, neuroblastoma/glioblastoma, ovarian 
cancer, skin cancer, liver cancer, melanoma, squamous cell carcinomas of the mouth, throat, 
larynx, and lung, colon cancer, cervical cancer, cervical carcinoma, breast cancer, epithelial 
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cancer, renal cancer, genitourinary cancer, pulmonary cancer, esophageal carcinoma, head 
and neck carcinoma, hematopoietic cancers, testicular cancer, colo-rectal cancers, prostatic 
cancer, or pancreatic cancer. 

Cytidine deaminase inhibitors are of benefit to individuals who are over-expressing 
5 cytidine deaminases, as well as those individuals who would benefit from a reduction in 
their activity. Because cytidine deaminase inhibitors reduce the activity of cytidine 
deaminases, the mutation rate of certain nucleic acids recognized by the cytidine deaminase 
is reduced, and any subject that can benefit from a reduction in the activity of cytidine 
deaminases can be administered cytidine deaminase inhibitor. This includes those receiving 

10 exogenous nucleic acids via a viral vector, in which a reduction in the recognition of these 
viral vectors would be beneficial. Therefore, the cytidine deaminase inhibitors can be 
administered in combination with viral vectors. For example, inactivation of AID can 
permit introduction and function of the viral vector delivered therapeutic model, such as 
RNA, DNA, siRNA, or protein, so these two technologies can be used in tandem. 

15 One skilled in the art will appreciate that the viral vector can comprise any viral 

vector useful in delivery of exogenous nucleic acids. For example, the viral vector can be a 
recombinant adenovirus vector, an adeno-associated viral vector, a lentiviral vector, a 
pseudotyped retroviral vector, a vaccinia vector, an alphavirus vector, or any other viral 
vector known in the art or described throughout. 

20 In particular, the viral vector can be a retrovirus. A retrovirus is an animal virus 

belonging to the virus family of Retro viridae, including any types, subfamilies, genus, or 
tropisms. Retroviral vectors, in general, are described by Verma, I.M., Retroviral vectors 
for gene transfer. In Microbiology- 1985, American Society for Microbiology, pp. 229-232, 
Washington, which is incorporated by reference herein in its entirety for the retroviral 

25 vectors taught and methods of making or using the same. Examples of methods for using 
retroviral vectors for exogenous nucleic acid delivery are described in U.S. Patent Nos. 
4,868,1 16 and 4,980,286; PCT applications WO 90/02806 and WO 89/07136; and 
Mulligan, (Science 260:926-932 (1993)); the teachings of which are incorporated herein by 
reference in their entirety for the methods taught herein. The retrovirus can be in the 

30 Oncovirinae subfamily of retroviruses, such as HTLV-I or HTLV-II (human T-cell 

leukemia virus type I and type II, respectively). Additionally, the retrovirus can be in the 
Lentivirinae subfamily of retroviruses, such as HIV-1, HIV-II, SIV, FIV, EIAV and CAEV 
(human immunodeficiency virus type I, human immunodeficiency virus type II, simian 
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immunodeficiency virus, feline immunodeficiency virus, equine infectious anemia virus, 
and caprine arthritis-encephalitis virus, respectfully). 

A retrovirus is essentially a package that has packed into it nucleic acid cargo. The 
nucleic acid cargo carries with it a packaging signal, which ensures that the replicated 
5 daughter molecules will be efficiently packaged within the package coat. In addition to the 
package signal, there are a number of molecules which are needed in cis, for the replication, 
and packaging of the replicated virus. Typically a retroviral genome contains the gag, pol, 
and env genes which are involved in the making of the protein coat. Retrovirus vectors 
typically contain a packaging signal for incorporation into the package coat, a sequence 

10 which signals the start of the gag transcription unit, elements necessary for reverse 
transcription, including a primer binding site to bind the tRNA primer of reverse 
transcription, terminal repeat sequences that guide the switch of RNA strands during DNA 
synthesis, a purine rich sequence 5 1 to the 3* LTR that serve as the priming site for the 
synthesis of the second strand of DNA synthesis, and specific sequences near the ends of 

15 the LTRs that enable the insertion of the DNA state of the retrovirus to insert into the host 
genome. The removal of the gag, pol, and env genes allows for about 8 kb of foreign 
sequence to be inserted into the viral genome, become reverse transcribed, and upon 
replication be packaged into a new retroviral particle. This amount of nucleic acid is 
sufficient for the delivery of a one to many genes depending on the size of each transcript. 

20 It is preferable to include either positive or negative selectable markers along with other 
genes in the insert. 

Since the replication machinery and packaging proteins in most retroviral vectors 
have been removed (gag, pol, and env), the vectors are typically generated by placing them 
into a packaging cell line. A packaging cell line is a cell line which has been transfected or 

25 transformed with a retrovirus that contains the replication and packaging machinery, but 

lacks any packaging signal. When the vector carrying the DNA of choice is transfected into 
these cell lines, the vector containing the gene of interest is replicated and packaged into 
new retroviral particles, by the machinery provided in cis by the helper cell. The genomes 
for the machinery are not packaged because they lack the necessary signals. 

30 Because cytidine deaminases are known to act upon viruses, inactivating these 

cytidine deaminases in the presence of viral vectors allows for tissue specific gene transfer 
and insertion for therapeutic purposes (i.e. retroviral DNA synthesis and genome 
incorporation). The infectivity of HIV- 1 is markedly reduced in T lymphocytes that 
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naturally express CEM15 (Sheehy et al., (2002) Nature, 418, 646-50). Recent research has 
demonstrated that CEM15 acts as a host cell defense mechanism by introducing numerous 
dC to dU and dG to dA mutations in the HIV-1 genome during it's replication, rendering the 
virus nonfunctional (Mangeat, B. et al., (2002) Nature 424, 99-103; Mariani, R. et al., 
5 (2003) Cell 1 14, 21-31; Zhang, H. et al., (2003) Nature 424, 94-8) . A critical step for this 
activity is for CEM15 to incorporate into viral particles during their assembly (Mariani, R. 
et al., (2003) Cell 114, 21-31; Stopak, K. et al., (2003) Mol Cell 12, 591-601) and then, 
subsequent to infection, mutate single stranded DNA during viral replication Mangeat, B. et 
al., (2002) Nature 424, 99-103; Zhang, H. et al., (2003) Nature 424, 94-8; Stopak, K. et al., 

10 (2003) Mol Cell 12, 591-601). This defense mechanism can be blocked by the HIV protein 
known has Vif (Sheehy et al., (2002) Nature, 418, 646-50) which reduces the expression of 
CEM15 (Stopak, K. et al., (2003) Mol Cell 12, 591-601) and binds to CEM15 (Mariani, R. 
et al., (2003) Cell 1 14, 21-31; Yang, B. et al., (2003) J. Biol Chem. 278, 6596-602), 
targeting it to the protein degradation pathway of the cell (Stopak, K. et al., (2003) Mol Cell 

15 12, 591-601; Liu, B. et al., (2004) J. Virol. 78, 2072-81; Sheehy, A.M., et al., (2003) Nat. 
Med. 9, 1404-7). Viral particles produced from Vif positive HIV contain Vif (Ohagen, A. 
and Gabuzda, D. (2000) J. Virol. 74, 1 1055-66; Zhang, H. et al., (2000) J. Virol. 74, 8252- 
61; Simon, J.H. et al., (1996) J. Virol. 70, 5297-305; Khan, M.A. et al., (2001) J. Virol. 75, 
7252-65) but contain low or no CEM15 (Mariani, R. et al., (2003) Cell 1 14, 21-31; Stopak, 

20 K. et al., (2003) Mol Cell 12, 591-601; Liu, B. et al., (2004) J. Virol. 78, 2072-81; Sheehy, 
A.M., et al., (2003) Nat. Med. 9, 1404-7) and consequently are infectious. These new 
understandings suggest that there are at least two times when CEM15 activity is critical in 
host defense from HIV infection. These are: early during infection when the viral RNA 
genome is replicated and late in infection when viral particles are being assembled. Both of 

25 these require CEM 15 localization within the cytoplasm of the host cell and this has been 
confirmed by immunolocalization and biochemical fractionation. 

The compositions comprising a cytidine deaminase inhibitor in a pharmaceutically 
acceptable carrier may be administered orally, parenterally (e.g., intravenously), by 
intramuscular injection, by intraperitoneal injection, transdermally, extracorporeally, 

30 topically or the like, although topical intranasal administration or administration by inhalant 
is typically preferred. As used herein, "topical intranasal administration" means delivery of 
the compositions into the nose and nasal passages through one or both of the nares and can 
comprise delivery by a spraying mechanism or droplet mechanism, or through 
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aerosolization of the nucleic acid or vector. The latter may be effective when a large 
number of animals is to be treated simultaneously. Administration of the compositions by 
inhalant can be through the nose or mouth via delivery by a spraying or droplet mechanism. 
Delivery can also be directly to any area of the respiratory system (e.g., lungs) via 
5 intubation. The exact amount of the compositions required will vary from subject to 
subject, depending on the species, age, weight and general condition of the subject, the 
severity of the disorder being treated, the particular nucleic acid or vector used, its mode of 
administration and the like. Thus, it is not possible to specify an exact amount for every 
composition. However, an appropriate amount can be determined by one of ordinary skill 

10 in the art using only routine experimentation given the teachings herein. 

Cytidine deaminase inhibitor for APOBEC-1, used to reduce apoB mRNA editing in 
the small intestine, can be administered orally, for example. RNA is stable to stomach acids, 
and the ribonucleases can have modified backbones in one embodiment. As disclosed 
herein, peptide sequences can be used with the RNA molecule during its synthesis (for 

15 example, using the TAT motif for enhanced uptake). It is also envisioned that peptide- 
oligos contain information, not only for compound entry, but also for targeting by 
employing peptidyl nuclear or cytoplasmic nuclear localization signals, such as PNAs, can 
be used. 

In one aspect, the cytidine deaminase inhibitor molecule can be a PNA (peptide- 
20 RNA), for example. This molecule can serve to both stabilize the oligonucleotide as well as 
encoding a motif such as the cell transduction motif of 'TAT', for example, which permits 
the penetration of the cytidine deaminase inhibitor into cells. For example, the 
methodology to establish a peptide in covalent linkage to an oligonucleotide has been 
described by Glen Research (http://www.glenresearch.com) (Example 10). In another 
25 aspect, the cytidine deaminase inhibitor can be a non-peptidyl-oligo composed of an inert 
non-peptidyl based mimic. For example, any compound possessing one or non-protease 
digestable amino acid linkages (i.e., inert peptide bond) can be used in this aspect. The 
synthesis of the non-peptidyl-oligos can be performed using similar techniques described 
above for producing peptide-oligos. Examples of molecules having one or more inert 
30 peptide bonds are well known in the art and can be found, for example, in the HIV-1 

protease inhibitor field. In one aspect, the non-peptidyl based mimic UIC-94017 disclosed 
in Koh et aL 9 Antimicrobial Agents and Chemotherapy, vol. 47, no. 10, p. 3123-3129, 
October 2003, can be used herein. 
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In one example, nanoparticles are used such as those described by Lambert G, et al. 
(Drug Deliv Rev. 2001 ;47, 99-1 12). Lambert et al. reviewed various assemblies to deliver 
antisense oligonucleotides. For example, in vivo, polyalkylcyanoacrylate (PACA) 
nanoparticles were able to efficiently distribute the antisense molecules to the liver. 
5 Alginate nanosponges can concentrate the antisense molecules in the lungs. Antisense 

loaded to PACA nanoparticles were able to improve the treatment of RAS cells expressing 
the point mutated Ha-ras gene in mice. 

The composition can include active components that can be delivered to a subject 
for which delivery to the intestine is desired. Examples of such additional components are 

10 numerous and are well known in the art. For example, there have been described various 
forms of composites which make possible active ingredients to be delivered to the intestine 
including for example those in the form of capsule (Japanese Patent Laid-Open Publication 
Nos. 41422/1992, 225922/1992, 179618/1994, and 327634/1995), those coated with 
monolayer (Japanese Patent Laid-Open Publication Nos. 368321/1992, and 2701/1995), and 

15 those coated with chitosan and a special polymer (Japanese Patent Laid-Open Publication 
Nos. 34927/1991, 69333/1992, and 217924/1992), herein incorporated in their entirety for 
methods of delivery to the intestine. 

Intestinal absorption enhancers can be used as well. These fall within the following 
general classes: (1) calcium chelators, such as citrate and EDTA; and (2) surfactants, such 

20 as sodium dodecyl sulfate, bile salts, palmitoylcarnitine, and sodium salts of fatty acids, for 
example. 

Parenteral administration of the composition, if used, is generally characterized by 
injection. Injectables can be prepared in conventional forms, either as liquid solutions or 
suspensions, solid forms suitable for solution of suspension in liquid prior to injection, or as 
25 emulsions. A more recently revised approach for parenteral administration involves use of 
a slow release or sustained release system such that a constant dosage is maintained. See, 
e.g., U.S. Patent No. 3,610,795, which is incorporated by reference herein in its entirety for 
the methods taught. 

The compositions may be in solution or in suspension (for example, incorporated 
30 into microparticles, liposomes, or cells). These compositions may be targeted to a particular 
cell type via antibodies, receptors, or receptor ligands. The following references are 
examples of the use of this technology to target specific proteins to given tissue (Senter, et 
al., Bioconjugate Chem., 2:447-451, (1991); Bagshawe, K.D., (1989) Br. J. Cancer, 60, 
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275-281,; Bagshawe, et al., (1988) Br. J. Cancer, 58, 700-703,; Senter, et al., (1993) 
Bioconjugate Chem., 4, 3-9,; Battelli, et al., (1992) Cancer Immunol. Immunother., 35, 421- 
425,; Pietersz and McKenzie, (1992) Immunolog. Reviews, 129, 57-80,; and Roffler, et aL, 
(1991) Biochem. Pharmacol, 42, 2062-2065,). Vehicles such as "stealth" and other 
5 antibody conjugated liposomes (including lipid mediated drug targeting to colonic 

carcinoma), receptor mediated targeting of DNA through cell specific ligands, lymphocyte 
directed tumor targeting, and highly specific therapeutic retroviral targeting of murine 
glioma cells in vivo. In general, receptors are involved in pathways of endocytosis, either 
constitutive or ligand induced. These receptors cluster in clathrin-coated pits, enter the cell 

10 via clathrin-coated vesicles, pass through an acidified endosome in which the receptors are 
sorted, and then either recycle to the cell surface, become stored intracellularly, or are 
degraded in lysosomes. The internalization pathways serve a variety of functions, such as 
nutrient uptake, removal of activated proteins, clearance of macromolecules, opportunistic 
entry of viruses and toxins, dissociation and degradation of ligand, and receptor-level 

1 5 regulation. Many receptors follow more than one intracellular pathway, depending on the 
cell type, receptor concentration, type of ligand, ligand valency, and ligand concentration. 
Molecular and cellular mechanisms of receptor-mediated endocytosis has been reviewed 
(Brown and Greene, (1991) DNA and Cell Biology 10, 6, 399-409). 
Pharmaceutical^ Acceptable Carriers 

20 Delivery of the cytidine deaminase inhibitors can be used therapeutically in 

combination with a pharmaceutically acceptable carrier. Pharmaceutical carriers are known 
to those skilled in the art. These most typically would be standard carriers for 
administration of drugs to humans, including solutions such as sterile water, saline, and 
buffered solutions at physiological pH. The compositions can be administered 

25 intramuscularly or subcutaneously. Other compounds will be administered according to 
standard procedures used by those skilled in the art. 

Pharmaceutical compositions may include carriers, thickeners, diluents, buffers, 
preservatives, surface active agents and the like in addition to the molecule of choice. 
Pharmaceutical compositions may also include one or more active ingredients such as 

30 antimicrobial agents, anti-inflammatory agents, anesthetics, and the like. 

The pharmaceutical composition may be administered in a number of ways depending 
on whether local or systemic treatment is desired, and on the area to be treated. Administration 
may be topically (including opthamalically, vaginally, rectally, intranasally), orally, by 
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inhalation, or parenterally, for example by intravenous drip, subcutaneous, intraperitoneal or 
intramuscular injection. The disclosed compounds can be administered intravenously, 
intraperitoneally, intramuscularly, subcutaneously, intracavity, or transdermally. 

Preparations for parenteral administration include sterile aqueous or non-aqueous 
5 solutions, suspensions, and emulsions. Examples of non-aqueous solvents are propylene 
glycol, polyethylene glycol, vegetable oils such as olive oil, and injectable organic esters 
such as ethyl oleate. Aqueous carriers include water, alcoholic/aqueous solutions, 
emulsions or suspensions, including saline and buffered media. Parenteral vehicles include 
sodium chloride solution, Ringer's dextrose, dextrose and sodium chloride, lactated 

10 Ringer's, or fixed oils. Intravenous vehicles include fluid and nutrient replenishers, 
electrolyte replenishers (such as those based on Ringer's dextrose), and the like. 
Preservatives and other additives may also be present such as, for example, antimicrobials, 
anti-oxidants, chelating agents, and inert gases and the like. 

Formulations for topical administration may include ointments, lotions, creams, gels, 

15 drops, suppositories, sprays, liquids and powders. Conventional pharmaceutical carriers, 
aqueous, powder or oily bases, thickeners and the like may be necessary or desirable. 

Compositions for oral administration include powders or granules, suspensions or 
solutions in water or non-aqueous media, capsules, sachets, or tablets. Thickeners, flavorings, 
diluents, emulsifiers, dispersing aids or binders may be desirable. 

20 Some of the compositions may potentially be administered as a pharmaceutically 

acceptable acid- or base- addition salt, formed by reaction with inorganic acids such as 
hydrochloric acid, hydrobromic acid, perchloric acid, nitric acid, thiocyanic acid, sulfuric 
acid, and phosphoric acid, and organic acids such as formic acid, acetic acid, propionic acid, 
glycolic acid, lactic acid, pyruvic acid, oxalic acid, malonic acid, succinic acid, maleic acid, 

25 and fumaric acid, or by reaction with an inorganic base such as sodium hydroxide, 

ammonium hydroxide, potassium hydroxide, and organic bases such as mono-, di-, trialkyl 
and aryl amines and substituted ethanolamines. 
Therapeutic Uses 

The dosage ranges for the administration of the compositions are those large enough 
30 to produce the desired effect in which the symptoms of the disorder are affected. The 
dosage should not be so large as to cause adverse side effects, such as unwanted cross- 
reactions, anaphylactic reactions, and the like. Generally, the dosage will vary with the age, 
condition, sex and extent of the disease in the patient and can be determined by one of skill 

51 



WO 2005/115410 



PCT/US2005/016001 



in the art. The dosage can be adjusted by the individual physician in the event of any 
contraindications. Dosage can vary, and can be administered in one or more dose 
administrations daily, for one or several days. 

Cytidine deaminase inhibitors that do not have a specific pharmaceutical function, 
5 but which may be used for tracking changes within cellular chromosomes or for the delivery 
of diagnostic tools for example can be delivered in ways similar to those described for the 
pharmaceutical products. 

As described previously, cytidine deaminase inhibitors can be administered together 
with other forms of therapy. For example, the molecules can be administered with 
10 antibodies, antibiotics, or other cancer treatment protocols as described above, or viral 

vectors. When the cytidine deaminase inhibitor is in a vector, as described above, the vector 
containing the nucleic acid for therapeutic purposes can also contain the cytidine deaminase 
inhibitor. 

Also disclosed are cells comprising a cytidine deaminase inhibitor or comprising a 

1 5 nucleic acid that encodes the cytidine deaminase inhibitor, such as those described above. 
Disclosed are cell lines comprising a cytidine deaminase inhibitor. The cytidine deaminase 
inhibitor can be contained in a vector, or can be part of the genome of the cell. The cytidine 
deaminase inhibitor can comprise a nucleoside incorporated in a polymeric substrate, 
wherein the polymeric substrate targets a cytidine deaminase or a cytidine deaminase 

20 auxiliary protein. The cytidine deaminase inhibitor can be any of those cytidine deaminase 
inhibitors described above. 

The disclosed cell lines can be used in a variety of ways. For example, they can be 
used as tools to study cytidine deaminases. The cell line can be monitored in vitro for the 
inhibitory activity of cytidine deaminase inhibitors. The cell line can also be used for 

25 screening drugs that inhibit the efficacy of the cytidine deaminase inhibitor. Alternatively, 
cells of the cell line can be administered to a test animal and monitored for the effect 
thereof. The cell line can be used for drug discovery and for drug validation. Substances that 
are known or suspected of interacting with cytidine deaminase inhibitors can be 
administered to the cell line, and the effects thereof monitored. Combinations of the above 

30 can also be used to monitor the cause and effect relationship of various drug candidates. The 
disclosed cell lines can also be used as reagents to produce other beneficial cell lines, by for 
example, allowing the cell lines to multiply. These cell lines are useful as model systems 
for drug discovery and validation. 
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Methods of using the compositions 

Disclosed are methods of inhibiting cytidine deaminases comprising contacting a 
cell containing the cytidine deaminase with a cytidine deaminase inhibitor under conditions 
that allow the inhibitor to inhibit the deaminase function of the cytidine deaminase. The step 
5 of contacting the cytidine deaminase with a Ccytidine deaminase inhibitor can occur either 
in vivo or in vitro. 

By "inhibiting cytidine deaminases" is meant inhibiting the functionality of the 
enzyme, or inhibiting its ability to deaminate nucleic acids. The cytidine deaminase can be 
inhibited at 10%, 20%, 30%, 40%, 50%, 60%, 70%, 80% or 90%, or 100% or greater or any 

10 amount in between, compared to a control cytidine deaminase or the normal functionality of 
the cytidine deaminase either in vivo or in vitro. 

Also disclosed is a method of treating a subject with a disorder associated with a 
cytidine deaminase comprising administering to the subject a cytidine deaminase inhibitor 
as taught herein. The disorder associated with cytidine deaminase can be a disorder 

15 associated with AID. Such disorders include, but are not limited to, B cell malignancies 

such as diffuse large B cell lymphomas and chronic lymphatic leukemia. The disorder can 
also be a T cell lymphoma or a pulmonary adenoma. 

The disorder associated with cytidine deaminase can also be a disorder associated 
with APOBEC-1. Such disorders include, but are not limited to cancers such as colorectal 

20 cancer, neurofibromatosis, atherosclerosis, steatosis (fatty liver disease) and metabolic 
syndrome. 

The disorder can also be associated with CEM15. Such disorders include, but are not 
limited to, cancers such as breast cancer. 

As discussed above, also disclosed are methods of promoting viral vector 
25 incorporation in a subject comprising the step of administering to the subject a therapeutic 
amount of a cytidine deaminase inhibitor. The cytidine deaminase inhibitor can be any 
cCytidine deaminase inhibitors described herein. 

Disclosed are methods of promoting incorporation of a viral vector in a subject 
comprising the steps of administering to the subject a therapeutic amount of a cytidine 
30 deaminase inhibitor and administering the viral vector to the subject. The cytidine 

deaminase inhibitor can be any of those cytidine deaminase inhibitors described above. 

Cytidine deaminase inhibitors can induce nucleotide deamination in the sequence of 
the retroviral vector itself either during its reverse transcription, relpication and/or during 
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the expression of RNA from the transgene after the virus integrated into the cell's 
chromosomes. The gene that the retrovirus is carrying is mutated as well, thereby reducing 
or eliminating functional protein expression (an effect that appears as low efficiency of 
transduction by the virus). Also as described above, the sites that are affected are those that 
are exact or approximate target sites that enable binding of CEM15 directly or through an 
auxiliary protein of CEM15. The partial or total mutation of Cs (or A's as would be the 
case of ADAR1) blunts the ability of the retroviruses to express functional proteins from its 
own genes and the gene(s) it carries and therefore the retrovirus is not able to integrate into 
the cells' chromsomes. 

Also disclosed is a method of screening for a cytidine deaminase inhibitor 
comprising the steps of contacting a cytidine deaminase with an agent to be screened, 
wherein the agent is incorporated in a polymeric substrate, wherein the polymeric substrate 
targets the cytidine deaminase; and determining the level of cytidine deaminase activity. A 
reduction in cytidine deaminase activity, as compared to cytidine deaminase activity in the 
absence of the agent to be screened, indicating a cytidine deaminase inhibitor. A reduction 
in cytidine deaminase activity is defined as less than 10%, 20%, 30%, 40%, 50%, 60%, 
70%, 80%, 90%, 100% or greater or any amount in between reduction in activity compared 
to a control or to the normal level of activity ascribed to the cytidine deaminase. 

In the screening methods, the cytidine deaminase can be in a cell, or can be in a 
transgenic animal. The method of screening can occur either in vivo, ex vivo, or in vitro. 
The polymeric substrate can comprise oligonucleotide residues. The polymeric substrate can 
be a nucleic acid that specifically targets the cytidine deaminase. Cytidine deaminase 
activity can be determined using a variety of techniques known in the art and taught by the 
following examples. 

The present invention is more particularly described in the following examples 
which are intended as illustrative only since numerous modifications and variations therein 
will be apparent to those skilled in the art. 

EXAMPLES 

Example 1: Methods for synthesizing Context Dependent Inhibitors containing 
zebularine 

Sources of Deoxy Zebularine Phosphoramidite: synthesis of deoxyzebularine 
phosphoramidite (DNA) is known in the art (Cheng, J.C., et al., J Natl Cancer Inst, 
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95(5):399-409, 2003). Also, 5-methyl-2'-deoxyzebularine is commercially available from 
Glen Research. The base is rather unstable at high pH during which the exocylic amine 
undergoes deblocking at the end of the synthesis. Hence, the synthesis needs "ultramild CE 
phosphoramidites," which can be purchased from Glen Research (http://www.glenres.com). 
5 There are the following options for synthesis: DNA Phosphoramidites, CE 

phenoxyacetyl (Pac) protected dA, CE acetyl (Ac) protected dC, and CE 4-isopropyl- 
phenoxyacetyl (iPr-Pac) protected dG (thymine/uridine do not need to be deblocked since 
there is no exocylic amine). 

The forementioned method was employed in the synthesis of a context dependent 

10 inhibitor of DNA harboring 5-methyl-2'-deoxyzebularine. As proof of principle, a 

sequence was synthesized was a 15-mer comprising: 5'-d(AGC-TAG-(dmZ)-TAA-GTT- 
AT)-3' (SEQ ID NO: 22), where one of the edited positions has been replaced by 5-methyl- 
2'-deoxyzebularine denoted (dmZ). This product represents a substrate for the enzyme 
activation induced deaminase (AID) as reported by Goodman and colleagues (Bransteitter, 

15 R. et al (2003) Proc. Natl. Acad. Sci. USA 100, 4102-4107). The phosphoramidites used 
were those of the ultramild class, which were purchased from Glen Research. The 
phosphoramidites were coupled on a 1 (amole scale using a DNA Synthesizer with coupling 
times and procedures recommended by Glen Research for mild phosphoramidites. The 
DNA product was deblocked and the trityl group removed under mild conditions described 

20 by Glen Research. The partially lyophilized product was resuspended in 0.1 M 

Triethylammonium acetate (TEAA) buffer pH 7.0 (Buffer A) and subjected to HPLC 
purification using a 1.9 x 30 cm |i-Bondapack CI 8 column (Waters) operated at 7 ml min" 1 . 
The product eluted as a single peak (Fig. 12) with a retention time of 44.3 min; failure 
sequences and impurities were clearly separated in this step.. The ion pairing elution buffer 

25 comprised Buffer A with 50% acetonitrile (Buffer B). The percent of Buffer B upon elution 
was 26%. The gradient started at 15% Buffer A and ended at 30% Buffer A in 70 min. Peak 
fractions were pooled and the pure product was lyophilized and desalted on a SepPak CI 8 
cartridge (Waters). The DNA was detected at 260 nm and the pooled material was 
lyophilized to dryness. The yield was 25% (nearly 2 mg). The sample was resuspended in 

30 water and analyzed by MALDI/TOF mass spectrometry (Fig. 13). The major peak was 
observed at an m/z of 4532. This result demonstrates the existence of an oligomer of the 
correct MW that represents the largest component of the sample. 
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As another option to make RNA inhibitors containing 2 , -OMe groups, the following 
options are available, although the deoxyzebularine must be used at the site of C-to-U 
conversion: 2'-OMe RNA Phosphoramidites, phenoxyacetyl (Pac) protected 2 f -OMe-A, 
acetyl (Ac) protected 2 , -OMe-C, and 4-isopropyl-phenoxyacetyl (iPr-Pac) protected 2'- 
5 OMe-G (regular 2'-OMe-uridine can be used). 

The Cap A reagent to phenoxyacetic anhydride needs to be changed to be 
compatible with ultramild bases, and used with an "ultramild" solid phase support (page 22- 
23 of the Glen Catalog on the internet). 2'-OMe-U can be purchased as a starting material 
but 2'-OMe CPG has to be synthesized. Using these mild reagents allows the ammonia 

10 deblocking step to be reduced to 4 hours at room temp. On the Glen Research website, 
0.05M potassium carbonate in methanol is recommended. Alternatively, ethanolic 
ammonium hydroxide can be used. 

Apart from the above, no changes are needed to the synthesis. The zebularine 
phosphoramidite can be dissolved at the normal concentrations (0.1M), filtered, and coupled 

15 for the same time as a normal deoxy phosphoramidite. However, careful purification is 

needed as the mild ammonia reduces, but does not eliminate, destruction of the zebularine 
base. "DMT-on" purification can be performed, along with initial purification on reverse- 
phase HPLC (CI 8) with buffers as follows: A: 0.1 M triethylammonium acetate pH 6.5 + 
5% CH 3 CN; B: 0.1 M triethylammonium acetate pH 6.5 + 65 % CH 3 CN; and 25 to 65 % B, 

20 over 30 minutes. The desired DMT-on product elutes at about 12-15 minutes (failure 
sequences, which lack the hydrophobic group elute in the void volume). 

The product is collected in bulk based on absorption at 260. nm, lyophilized to near 
dryness at -95° C to remove TEAA buffer and the DMT group hydrolized by incubating in 
80% acetic acid/20% water for 30 minutes. The solution is stopped by freezing in liquid N2 

25 and acetic acid can be removed by lyophilization; adding 1 ml of water and evaporating 
(repeat 3 times) removes all the acid). 

A normal oligonucleotide is pure at this stage, however this is not true with zeb- 
derivatives. A second reverse phase HPLC purification is required with a shallower 
gradient; 0-25% buffer B, over 30 minutes. A cluster of closely running peaks elute at 

30 around 10-15 minutes. The desired zeb product is usually the biggest, slowest running peak. 
Identity can be confirmed by MALDI-TOF MS or UV absorbance/fluorescence 
spectroscopy, zebularine absorbs at -305 nm and zebularine oligos contain a shoulder 
(sometimes not very prominent) at this wavelength. Zebularine is also fluorescent, 
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excitation at 305 gives fluorescence at about 390 nm. Yields are usually 20-30% compared 
to an oligonucleotide of the same length with only normal bases. 

The phosphoramidite is stored desiccated at minus 20°C (stable forever). Likewise 
the final zebularine-oligos are stable when stored frozen at -20°C. 

5 

Example 2: Alternate methods for synthesizing cytidine deaminase inhibitors 

One compound useful as a cytidine deaminase inhibitor can be 4-fluoro zebularine. 
One advantage of such an inhibitor is that it is more reactive than the parent compound. 
Conceptually, the nucleoside should hydrate at C4 like zebularine and 5-fluorozebularine. 

10 However, the hydrated tetrahedral carbon has F instead of H, and since F is more an isostere 
for the hyrdoxyl, the compound may not bind effectively at the active site. Other 
substitutions at 2' are designed to influence the ribose sugar pucker to more closely mimic 
the conformers of RNA, as well as their hydrogen bonding properties. In this respect, 2'- 
Fluoro can be used. Due to its inert nature, this position does not require protection during 

15 chemical synthesis. 

Example 3: Reduction in Small Intestinal apoB mRNA Editing to Reduce B48 
Containing Chylomicron Remnants in Patients with Metabolic Syndrome. 

The nucleotide analog deoxyzebularine, in the context of single-stranded (ss) RNA 
20 or PNA oligonucleotides comparable to SEQ ED NO: 12, can be used to reduce apoB 

mRNA editing in the small intestine. Cholesterol is carried in the blood from one tissue to 
another as lipoprotein particles by specific carrier proteins called apolipoproteins. 
Apolipoprotein B (apoB) is an integral and non-exchangeable structural component of 
lipoprotein particles referred to as chylomicrons, very-low-density lipoprotein (VLDL) and 
25 low-density lipoprotein (LDL). ApoB circulates in human plasma as two isoforms, 

ApoB 100 and apoB48. ApoB 100 can be converted into ApoB48 by the enzyme APOBEC- 
1 (ApoB mRNA editing catalytic subunit-1). With the help of auxiliary factors, APOBEC-1 
can stop the synthesis (production) of ApoB 100 form and start the synthesis of ApoB48 
form in cells. ApoB 100 and ApoB48 play different roles in lipid metabolism. Various 
30 studies indicated that ApoBlOO-associated lipoproteins (VLDL and LDL) are much more 
atherogenic than ApoB48-associated lipoproteins. 

Stimulating hepatic ApoB mRNA editing is a way to reduce serum LDL through the 
reduction in synthesis and secretion of ApoB 100 containing VLDL. In most mammals 
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(including humans), ApoB mRNA editing is carried out only in the small intestine. Studies 
found that the presence of substantial editing in liver is associated with a less atherogenic 
lipoprotein profile compared with animals that do not have liver editing activity. APOBEC- 
1 is expressed in all tissues that carry out ApoB mRNA editing. Human liver does not 
5 express APOBEC-1 but it does express sufficient auxiliary proteins to complement 

exogenous APOBEC-1 in ApoB mRNA editing in transfected cells. Therefore, induction of 
editing in human liver reduces apoBlOO synthesis and thereby reduce the levels of 
circulating LDL cholesterol. 

10 Example 4: AID and Context Dependent Inhibitors 

The nucleotide analog deoxyzebularine [(l-P-D-2'-deoxyribofuranosyl) pyrimidin- 
2-one, dZ], in the context of single-stranded (ss) DNA oligonucleotides (or those 
comparable to Fig. 5) bearing SHM hotspots, is tested as a specific inhibitor of AID, while 
being inactive on deoxycytidine deaminases (dCDA) essential for pyrimidine metabolism. 

15 The in vivo substrate specificity of AID has been already established as the deamination of 
deoxycytidine (dC) at N4 in the context of single-stranded, but not doublestranded (ds) 
DNA (Bransteitter, R., et al., (2003) PNAS USA, 100, 4102-7,; Pham, P., et al. (2003) 
Nature, 424, 103-7; (; Ramiro, A.R., et al., (2003) Nat Immunol, 4, 452-6,; Chaudhuri, J., et 
al., (2003) Nature, 422, 726-30,; Dickerson, S.K., et al., (2003) J Exp Med, 197, 1291-6,). 

20 Furthermore, ssDNA deamination demonstrates activity on the SHM consensus sequence 
WRCY (SEQ ID NO: 19), where W = A or T, R = purine and Y = pyrimidine. By 
substituting dC with dZ in the context hot spot ssDNA oligos, selectively "poisoning" AID 
by inhibiting oligo release from its active site is possible. Previously, the use of free dZ and 
Z were shown to be intracellular inhibitors of mammalian deoxycytidine deaminases and 

25 cytidine deaminases involved in pyrimidine metabolism and revealed anti-tumor properties 
Barchi, J.J., Jr., et al., (2003) J Enzyme Inhib, 9, : 147-62, ). Enzymatic inhibition by dZ is 
derived not only from its ability to mimic the structure and hydrogen bonding properties of 
dC, but also through its reactivity with the Zn 2+ -dependent enzyme to form 3,4- 
dihydrodeoxyouridine, a mechanism based transition state mimic that binds the cytidine 

30 deaminase active site with a Ki of -10- 12 M. Furthermore, in the context of dsDNA, dZ 

forms a covalent adduct with DNA methyl transferases Wedekind, J.E., et al., (2003) Trends 
in Genet, 19, 207-16,. However, because ssDNAs are targeted by neither deoxycytidine 
deaminases, cytidine deaminases nor DNA methyl transferases, an oligo of the form 5'- 
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d(XnWRZYXn)-3' (SEQ ID NO: 20) selectively inhibits AID (where X = any nucleotide 
and n = multiple variable sequences, the nature of which depends on the context required by 
each deaminase or its auxiliary protein, 'n' is an integer to reflect the length of any flanking 
nucleotides of identity X. 
5 DNA oligonucleotides are chemically synthesized by use of an Applied Biosystems 

392 DNA/RNA synthesizer. Due to the lability of zebularine, methods employ Ultramild 
CE deoxyphosphoramidites (Glen Research, VA) following the recommendations of the 
manufacturer. The 2'-deoxyzebularine phosphoramidite can be synthesized by the methods 
of Zhou and coworkers (Zhou, L., et al., (2002) J Mol Biol, 321, 591-9, and refs therein). 

10 DNA products are purified by reverse phase HPLC as the DMT-on and DMT-off adducts, 
as described by Wedekind and McKay (Wedekind, J.E. and D.B. McKay (2000) Methods 
Enzymol, 317,149-68). 

Sequences of dZ-containing oligos are derived from previous work that 
demonstrated AID is active on ssDNA bearing the SHM hotspot sequence WRCY. 

15 Therefore, test and control DNA oligos are of the form: 5'- 

(RGTW)n[WR(C/Z)Y](RGTW)n 3' (SEQ ID NO: 21), where n = 1, 2, 3 or 4; sequences are 
chosen to minimize secondary structure based on thermodynamic stability calculated with 
the DNA option of program RNA structure (Mathews, D.H. et al., (2002) J Mol Biol, 317, 
191-203,). AID expressing reporter cell lines is cultured for 1 hr in serum-free medium in 

20 the absence or presence of dC- or dZ-containing oligonucleotides at concentrations lOnM- 
25 ^M, and then transferred back into complete medium. SHM activity is measured by 
GFP reporter reversion rates; residual toxicity is assessed by measuring cell viability by 
propidium iodide exclusion. Verification of in vitro activity is followed by in vivo studies 
on transgenic mice (Jansen, B. and U. Zangemeister-Wittke (2002) Lancet Oncol, 3, 672- 

25 83; Juliano, R.L. et al., (2000) Curr Opin Mol Ther, 2, 297-303) data are available for 

optimal delivery conditions of antisense nucleotides in mice, resulting in efficient delivery 
to splenic B lymphocytes (Zhao, Q., et al., (1998) Antisense Nucleic Acid Drug Dev, 8, 451- 
8). 

30 Example 5: CEM1 5/APOBEC-3G and Context Dependent Inhibitors 

Cytidine deaminase inhibitors designed to target CEM15 are tested in cell culture for 
the purpose of improving retroviral infectivity, which has application to the expression of 
exogenous nucleic acids and application to laboratory infections. Viral infectivity is 
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measured by use of vif (+) and vif (-) pseudotype viruses and host 293T cells either lacking 
or expressing the Context Dependent Inhibitor target CEM15. The inhibitory effect of 
CEM15 on the infectivity of vif-f and vif- HIV-1 particles by transient co-transfection of 
appropriate HIV-1 proviral DNA and CEM15 expression plasmids has been established 
5 (Sheehy, A.M., et aL, (2000) Nature, 418, 646-50). An assay has been developed using a 
VSV G-protein pseudotyped lentiviral particles that corroborates prior results on CEM15 
infectivity and is amenable to the rapid demarcation of the regions of HIV-1 DNA (or RNA) 
that are the targets of CEM15 deaminase activity. This methodology can be used in the 
identification of exact cytidine deaminase inhibitor substrates. 

10 An Env-deleted HIV-1 proviral DNA vector (derived from pNL43; AIDS Reagent 

Repository) was modified by replacement of Nef with a GFP reporter gene and two in- 
frame stop codons were inserted that abolished Vif production (pHR-GFPAvif) (confirmed 
by western blotting with anti-Vif antibodies (AIDs Reagent Repository). Stable, HA-tagged 
CEM15 expressing 293T cell lines were selected with puromycin and verified by western 

15 blotting with a HA specific monoclonal antibody (HA.l 1; BabCo). The expression of 

similar levels of full-length HA-tagged CEM15 (or mutant derivative thereof) are assayed in 
stable cell lines. Although structural modeling can predict focused mutations that impair 
deaminase activity without destabilizing the entire protein, expression of the mutants must 
be verified; the addition of the HA epitope tag has no effect on the ability of CEM15 to 

20 suppress infectivity (Sheehy, A.M., et aL, (2000) Nature, 418, 646-50, 2000). Isogenic HIV- 
1 pro-viral DNAs are packaged into pseudotyped lentiviral particles by co-transfection with 
a plasmid encoding the VSV G-protein into 293T cells that lack endogenous CEM15 (-) or 
expressed wild type CEM15 (+) as controls. The resulting pseudotyped particles containing 
HIV-1 RNA of near full-length (with only a ~2kb deletion) were quantified by reverse 

25 transcriptase (RT) assay; presence of p24Gag protein content can also be assayed by ELISA 
to normalize viral particles. A defined number (1x10 s cpm of RT activity) of these particles 
were added to target, virus susceptible MT2 cells (5xl0 5 ). To assess their infectivity, the 
percentage of cells that expressed the GFP indicator gene encoded by the packaged 
recombinant HIV-1 genome was quantified 24 hours later by flow cytometry (University of 

30 Rochester Core Facility). The results (Fig. 6) indicate that the expression of CEM15 in 
293T cells resulted in at least a 100-fold decrease in vif (-) viral infectivity compared to 
particles generated in parental 293T cells. The low level of GFP expression from vif(-), 
CEM15+ particles is indistinguishable from background fluorescence in control cells 
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[6.2%]. This assay is amenable to the use of existing HIV-1 proviral isotyped vectors that 
are deleted for different regions and different amounts of the HIV-1 genome. Deleted genes 
can be provided in trans by co-transfection of suitable expression plasmids. A recent 
comprehensive examination of viral proteins and host tRNA Lys3 derived from vif (-) virions 
5 revealed no significant biochemical or priming defects (Gaddis, N.C., et al., (2003) J Virol, 
77,5810-20). 

The effect of cytidine deaminase inhibitor compounds on CEM15 can be tested by 
treating one or more of the cell types that express CEM15 with various concentrations of 
cytidine deaminase inhibitor (of the form in Fig. 5) followed by subsequent infection with 

10 HIV-1 pseudotyped virus containing reporter GFP. Within 48 hours post infection, cell 
culture supernatants containing viral particles are added to HeLa cells to test their 
infectivity, as evidenced by the appearance of green fluorescent cells in FACS analysis. 
Reduction or elimination of green fluorescent cells relative to that observed in infections 
from CEM15 (+)cells HIV-1 vif (-) that were not treated with cytidine deaminase inhibitor 

1 5 compounds are scored as a positive identification of cytidine deaminase inhibitor inhibition 
of CEM15 deaminase activity. Inhibition of GPF reversion by CEM15 is analogous to 
CEM15 (+) vif (+) cell/virus phenotype. 

Example 6: AID activity and SHM specificity in human lymphoma cell lines 

20 High levels of SHM on reporter constructs are strongly dependent on the presence of 

Ig cis-acting regulatory elements, such as the Ejj. and Ek enhancers (Betz AG, et al & 
Neuberger MS. 1994. Cell 77: 239-248; Bachl J, & Wabl M. 1996 Immunogenetics 45: 59- 
64; Bachl J et al.1998. Proc Natl Acad Sci USA95: 2396-2399 In particular, a GFP-mutant 
reversion substrate was shown to mutate over an order of magnitude more efficiently in the 

25 presence of a linked E|li enhancer sequence (Bachl, et al., 1999 Eur J Immunol 166, 5051- 
64). This context-dependent specificity of SHM targeting is exploited to test whether 
lymphoma lines with different phenotypes also display differences in their Ig SHM 
targeting ability. 

A number of lymphoma cell lines have been obtained including SUDHL-4 and -6, 
30 OCI Ly-3, -7, -10 and -19. Preliminary RT-PCR expression data on a subset of the lines 
show that they express different levels of AID mRNA (Fig.l; wt, 646 bp band); PCR 
products of different size are also found in at least 4 of 6 lines tested. Interestingly, the 
Farage line, which is known not to undergo Ig SHM in vitro, displays predominantly the 
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smaller PCR products with little if any wild-type. Two of the alternative products (~ 940 bp 
and 530 bp) correspond to the expected sizes for two of the trAED isoforms already 
described, while the smallest 2 bands are novel. These PCR products are cloned and further 
characterized. 

5 The role of AID and its isoforms is investigated in this set of lymphoma lines, using 

assays capable of distinguishing between Ig-targeted and non-Ig-targeted SHM. A modified 
SHM GFP reversion assay (Bachl J (1999) Eur. J. Immunol 29, 1383-1389; Gabay C. 
(1999) Eur J Haematol 63, 180-191; Bachl J, et al., (2001) J Immunol 166, 5051-5057) has 
been developed. In this highly sensitive and quantitative assay, a mutation creating a stop 
10 codon embedded in a SHM hotspot sequence has been introduced in the GFP gene; as SHM 
targets this sequence, reversion occurs and the reverted cells are detected by their 
fluorescence. 

The assay is modified so that the mutated GFP coding sequence from plasmid pi 
was transferred under the control of the mouse strong, ubiquitous promoter from the 

15 phosphoglycerate kinase- 1 (pgk) gene, to generate the pgk-GFP* construct. A 0.7 kb 

fragment containing the entire E/i core enhancer was inserted 3' of the pgk polyadenylation 
site in the variant construct pgk-GFP*-Ep,. These constructs are stably transfected by 
electroporation in the lymphoma lines as well as the following control lines: - NIH3T3 wild 
type, as a negative control; - NIH3T3 - AID transfectant, as positive control for pgk-GFP* 

20 reversion; the hypermutating murine pre-B cell line 18.81 (as positive control for Ig-specific 
SHM, pgk-GFP*-Eja reversion). 

Cotransfection of a pgk-neo r antibiotic resistance gene followed by G418 selection 
is used to identify stably transfected clones. Presence of the GFP gene is assessed by PCR 
with GFP-specific primers. Individual clones are isolated and expanded (3-4/transfection, 

25 tested for comparable transcriptional activity), but large mixed pools of >50 transfectants 
are also maintained, which are analyzed in parallel to further control for potential 
integration-dependent effects and other sources of heterogeneity in the clonal transfectants. 
Reversion rates of reversion are quantified by flow cytometry at different time points, and 
the SHM activity on E|a-bearing and non-E|a-bearing plasmids compared as described by 

30 Bachl and colleagues. Sequencing of PCR-amplified GFP fragments from flow-sorted 
revertant GFP+ cells are used to further characterize the frequency and features of the 
mutations occurring in the individual lines. 
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Primary GC-like DLBCL samples have been shown to often display ongoing Ig 
SHM, while ABC-like DLBCL do not (Lossos IS, et al., (2000) Proc. Natl Acad. ScL USA 
97, 10209-10213). However, this is not necessarily the case for the culture-adapted 
lymphoma lines described above, which could have diverged from their original 
5 phenotypes. Indeed, if non-specific SHM does promote a mutator-like phenotype, it can be 
expected to emerge during long-term culture even in lines with germinal center origin. 
Differences among cell lines in the targeting of SHM can be detected, with some lines 
showing Ig-specific SHM (reversion rate of pgk-GFP*-E|j, » pgk-GFP*), and others non- 
Ig-specific SHM (reversion rate of pgk-GFP*-Ep, ~ pgk-GFP*). 

10 Consistent with data indicating that the Farage line has no ongoing Ig SHM, this line 

expresses little wtABD mRNA, but instead several shorter isoforms. The Toledo line is 
known to display significant Ig variable gene heterogeneity, indicative of ongoing, Ig- 
targeted SHM. By analyzing lines with a range of SHM phenotypes, the role of AID in 
determining SHM specificity can be established. In particular, the extent by which a 

15 mutator phenotype is present in lymphoma lines can be established, and its relationship with 
the pattern of wtAID and trAID expression elucidated. Moreover, these same cell lines 
provide valuable model systems for other experiments described below. Lines displaying 
normal wtAID expression but lack of SHM specificity, suggest defects in AID accessory 
factors. Such lines represent useful models for the identification of such factors. 

20 

Example 7: Effect of AID isoforms on SHM targeting 

In some primary CLL samples, lack of Ig gene SHM is accompanied by the 
expression of unusual alternatively spliced forms of AID mRNA; similar mRNAs are also 
found in some, but not all, lymphomas. If translated, these mRNAs give rise to truncated 

25 AID isoforms with extensive alterations or loss of the pseudocatalytic domain. It appears 
that some of these isoforms affect SHM targeting specificity. This could occur by two 
alternative mechanisms, either the isoforms act directly as non-specific mutators, or they do 
so by interfering with wtAID function, by generating heterodimers that are perhaps 
incapable of properly associating with targeting factors. 

30 cDNAs from human wtAID and for presumptive trAID isoforms have already been 

cloned as described above; if the variant AID transcripts already identified is found to differ 
from the expected, CLL sample mRNAs are utilized to isolate the correct cDNAs. All RT- 
PCR products are first modified to bear protein tags for direct detection by shuttling them 
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into appropriate plasmid vectors (6xHis/Xpress- and HA-tag vectors are readily available). 
Different tags are appended to wtAID vs. trAIDs, to allow for specific immunoprecipitation 
experiments. The tagged cDNAs are then cloned into retroviral vectors. An effective 
system for retroviral delivery of wtAID using the MSCV-derived pMIG-IRES-GFP 
5 bicistronic vector has been established. 

Because the AID functional assays utilize a GFP gain of function reporter, the 
retroviral vector is modified by replacing GFP with the dsRed2 fluorescent protein coding 
sequence (Clontech), thereby enabling AID expression detection by immunofluorescent 
microscopy and flow cytometry in the red channel (pMIG-IRES-Red). Thus, pMIG-TRES- 
10 GFP (wtAID) and -Red (both wtAID and trAIDs) vectors are generated. Correct expression 
of the expected products is tested by transduction of NIH3T3 cells followed by Western blot 
analysis. 

Functional assays on the individual proteins are conducted in pgk-GFP* and pgk- 
GFP*-EiiNIH3T3 reporter cells, transduced with the ds-Red expressing retroviral vectors. 
1 5 Transduced cells are analyzed by flow cytometry, with red fluorescence marking trAID- 
expressing cells, and green fluorescence showing revertants. Rates of reversion (GFP- 
positive/Red2 -positive) are compared between reporters (pgk-GFP* and pgk-GFP*-E|i) and 
between cells expressing wtAID or individual trAIDs. This reveals whether trAIDs retain 
any intrinsic activity. 

20 The ability of trAIDs to affect wtAID activity and SHM targeting is established by 

inducing their expression in cell lines already expressing wtAID. Lymphoid cell lines are 
used expressing endogenous AID and confirmed to undergo Ig-targeted SHM such as the 
murine 18.81 cell line and one of the lymphoma lines described above. These lines are 
transduced with the various trAID-Red-encoding retroviral vectors, and SHM specificity 

25 measured on the basis of pgk-GFP* and pgk-GFP*-E|n reversion rates as described above. 

If a dominant effect of trAIDs on wtAID activity is observed, such as loss of SHM 
specificity in double-expressors, an interaction of wtAID and trAIDs is evaluated. NIH3T3 
cells are simultaneously transduced with both pMIG-HA/wtAID-IRES-GFP and pMIG-His- 
Xpress/trAID-IRES-Red vectors; such procedure has been shown to successfully generate 

30 double-expressor cells. Double-fluorescent (green/red) cells are enriched by flow cytometry 
and expanded. To demonstrate heterodimerization, total cell protein lysates are prepared 
and reciprocal immunoprecipitation reactions are performed with anti-HA of anti-Xpress 
antibodies, according to the manufacturer's protocols. Immunoprecipitated products are 
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separated by SDS-PAGE gels, and blotted onto nitrocellulose filters. The filters are 
detected with either anti-HA or anti-Xpress antibodies, followed by an HRP-conjugated 
secondary antibody and visualization by the West Pico Super Signal Chemiluminescent 
substrate (Pierce). 

5 Unlike the murine lines described above, human lymphoma cell lines require a 

slightly more complex retroviral transduction protocol. Briefly, the pMIG retroviral vector 
DNAs are first transfected by lipofection into the Phoenix-ampho packaging cell line, and 
the retroviral supematants derived from this first packaging cycle are then used to transduce 
the RD1 14 packaging cell line, which generates retroviruses with a Feline Leukemia Virus 
10 envelope able to efficiently transduce human cells. This "ping-pong," sequential packaging 
method is required for maximum yields because the RD1 14 cell line does not transfect 
efficiently enough to be used directly. Retroviral supematants from the second packaging 
cycle are then used for human cell line transduction; the infection rate in human lymphoma 
lines routinely averages from 20 to over 80% (Fig.3). 

15 

Example 8: AID'S oncogenic role in B cell neoplasias 

The finding of T cell lymphomas and pulmonary adenomas in AID transgenics 
confirms the oncogenicity of AID. These transgenics did not acquire B cell malignancies, 
raising the question of whether AID expression in fact plays a pathogenetic role in B cell 

20 neoplasias. B cells can be tested to determine if they have the potential to be transformed 
through AID-mediated oncogenesis, whether this is associated with initiation of 
tumorigenesis, and/or if AID expression only contributes to neoplastic progression. 

In human activated B cell-like DLBCL, the presence of mutations in Ig genes 
suggests a post-germinal center origin. Sustained AID transcription in the absence of 

25 ongoing Ig SHM, coupled with the frequent finding of SHM-like oncogene mutations in 

these cells, clearly suggest a deregulation of AID targeting. To investigate AID's effects on 
B lymphocytes, as models of human lymphomagenesis, transgenic mice are generated that 
replicate the AID expression pattern observed in human ABC-like DLBCL. 

The immunoglobulin IgH 3' enhancer element (3 'EH) is involved in the regulation 

30 of immunoglobulin gene expression and class switch recombination during B cell activation 
and terminal B cell differentiation. When linked to an Ig variable region promoter, the 
3 'EH hs (1,2) element has been shown to reliably direct transgene expression specifically in 
activated B cells and plasma cells. A transgenic expression vector containing the murine 
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VH186.2 promoter and the 3'EH hs(l,2) enhancer is available. His-tagged wtAID cDNA is 
cloned into this vector (3'E-AID construct), and transgenic mice are generated by 
pronuclear microinjection of C57B1/6 zygotes. 

Transgenic mice are identified by PCR for the fusion gene containing AID cDNA 
5 and the human y-globin 3 'UTR/poly-A site present in the vector. Appropriate expression of 
the transgene in activated B cells will be established by RT-PCR on flow-sorted B cell 
subpopulations (resting, B220+,IgD hi IgM l0 B cells; germinal center B220+/GL7+ cells; 
plasma cells, large B220 10 , CD43+, slg- cells) and by immunohistochemistry on splenic 
section using the anti-Xpress mAb and counterstains for GC cells (PNA+) and plasma cells 

10 (cytoplasmic Ig hl ). Transgene inducibility are also verified in vitro by culture of primary 
splenic B cells with activators (LPS, 20 |^g/ml; anti-CD40 clone HM40-3, 2 |J.g/ml) known 
to induce 3'EH activity, followed by RT-PCR and Western blotting at different time points 
(day 0, 1, 2, 4 of activation). Transgenic AID activity are functionally assessed by breeding 
the 3'E-AED transgenics with AID-deficient mice; if AID activity is present, at least partial 

15 complementation of the defect in SHM and CSR is observed in these mice. 3'E-AID/AED- 
/- mice are evaluated for their ability to undergo SHM by VH region sequencing using 
primers for Ig VHJ558 segments following sheep red blood cell immunization, and class 
switching by ELISA on pre-and post-immunization serum. 

The ability of transgenic AID expression to induce B cell neoplasias is evaluated by 

20 monitoring the health status of transgenics as they age. Mice displaying signs of ongoing 
disease (cachexia, lack of activity, fur loss, hunched posture, etc) are sacrificed and their 
lymphoid tissues (bone marrow, spleen, peripheral and mesenteric lymph nodes and Peyer's 
patches) analyzed histologically and by flow cytometry with an array of antibodies specific 
to B cell- (CD19, CD21, CD23, IgM, IgD, IgicA,, GL-7), T cell- (CD3e, CD4, CD8) and 

25 myeloid markers (Mac-1, Gr-1). Signs of B lymphoid expansion can be monitored; if this is 
observed, emergent clonality is established by Southern blot analysis of Ig gene 
rearrangements in the tissue's genomic DNA. 

To distinguish potential effects of AID as a progression factor, transgenic mice are 
bred onto genetic backgrounds already predisposed to B cell lymphoma development. 

30 Initially, E(i-bcl2 transgenic mice are used. This transgene results in rare, late (>12 months) 
lymphomas on a normal background, but dramatically accelerates lymphomagenesis when 
combined with another transgenic oncogene, such as Eji-c-myc, suggesting that 
accumulation of multiple secondary mutagenic hits is required for disease progression in 
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these mice. By generating double transgenics, any potential effect of AID expression in 
accelerating lymphoma development is established. In addition, should the effect of AID in 
the bcl2 background be weak, breeding with Eji-c-myc transgenic mice, which display a 
more aggressive baseline phenotype, can also be used. 
5 3'EH-hs(l,2) -dependent constructs are known to reliably express in activated B 

cells. The availability of AID-deficient mice allows the unequivocal determination of 
transgene expression and activity. Evidence of B cell malignancies following sustained 
AID expression, either as a primary event (in a normal background), or as a progression 
factor (in lymphoma-susceptible backgrounds) confirms that B lineage cells can be targets 

10 of AID-mediated transformation, and provide a valuable model for the study of the 

mechanisms of human AID-mediated lymphomagenesis. Lack of a cancer phenotype also 
can represent an informative result, as it suggests either an intrinsic protection of B 
lymphocytes against SHM mistargeting, or more likely, based on the strong evidence for 
SHM-like oncogene mutations in human lymphomagenesis, a role for still unidentified 

1 5 SHM effectors downstream of AID. 



Example 9: Molecular identification of non-Ig gene AID targets in lymphomas. 

A small number of oncogenes (c-myc, Piml, Pax5, RhoH/TTF) have been found to 
bear hallmarks of SHM in human lymphoma samples. Additional important targets can 

20 exist whose mutation contributes to neoplastic development. In this experiment, a mutation 
screening method based on a genetic selection strategy that exploits bacterial DNA 
mismatch repair is used. This method has been used to identify single nucleotide 
polymorphism in human genomic DNA and has been modified herein. 

These experiments take advantage of the mismatch repair detection (MRD) system, 

25 a novel, high-throughput bacterial positive genetic selection strategy for human disease 

related single nucleotide polymorphisms. In this example, the selection system is used as it 
was originally intended for screen mismatches in genomic DNA sequences. 

Genomic DNA isolated from a non-B cell source (e.g. fibroblasts) and from 
lymphomas from AED-transgenic mice is digested with DpnII (average size ~0.3kb) and 

30 cloned separately into two different plasmids. Unmethylated plasmids (grown in a dam 

methylase-deficient E. coli strain) containing the 'control' inserts (from normal tissue DNA) 
also encode an intact Cre recombinase, whereas the methylated plasmids contain putative 
mutated fragments from lymphoma cells and encode an inactive 5 nucleotide deletion 
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mutant of Cre. Heteroduplexes formed in vitro between the two plasmid libraries by 
melting and reannealing are transformed into a bacterial strain that harbors an F' episome 
carrying a 'floxed 5 tetracycline resistance gene. Repair of the mismatch uses the methylated 
strand as template, resulting in loss of the functional Cre recombinase gene and retention of 
5 the 'floxed' tetracycline resistance gene. Non-mismatched heteroduplexes, instead, induce 
no repair, express functional Cre, and result in Tet R LoxP-mediated deletion. The Tet R 
clones obtained through the MRD process therefore contain exclusively fragments 
displaying sequence heterogeneity between the original samples, and will be subject to 
further selection and identification steps. 

10 

Example 10: Deaminase inhibitors as agents for AID-targeted 
pharmacological intervention 

AID, like other cytidine deaminases, is known to be inhibited by Zn 2+ chelation. It 
has previously been shown that two types of Zn 2+ chelators and metalloenzyme inhibitors, 

15 tetracyclines and hydroxamates, display a remarkable ability to inhibit CSR in vitro at 

therapeutic concentrations. Tetracyclines also inhibit antibody responses in vivo. Although 
AID mRNA expression was not affected by tetracycline treatment in vitro), the possibility 
exists that the observed inhibitory activity reflected an indirect effect on CSR, rather than a 
direct inhibition of AID function. Because both tetracyclines and hydroxamates (and 

20 especially the former) are available drugs which could, if necessary, be rapidly applied in a 
clinical setting, they can be tested to determine if they are capable of directly inhibiting AID 
function in the model systems described above. 

The same reporter cells lines and assays are used as described in Example 9. AID 
activity on the GFP reversion assay is assayed in cells cultured in the presence of increasing 

25 concentrations of doxycycline (1-10 |^g/ml) and the KB8301 hydroxamate (5-50 jiM) 
(Becton Dickinson). Importantly, the assays are performed on cells expressing both 
endogenous (18.81, lymphoma lines) as well as exogenous AID (NIH3T3 transfectants); 
this controls for the possibility that either drug can act on some other component of the B 
cell SHM/CSR activation pathway (which is absent in NIH3T3). Assuming inhibition will 

30 be observed in all systems, additional agents are tested, including other tetracyclines and 
FDA-approved non-antibiotic tetracycline analogs (CMT series, from Collagenex 
Pharmaceuticals), as well as hydroxamates approved for cancer therapy, such as 
marimastat/BB2516/TA2516 (British Biotech/Schering Plough). The optimal agents, based 
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on inhibitory activity and known pharmacological properties, can then be tested in the AID 
transgenic mice derived in Example 7, if appropriate, to establish their ability to arrest of 
delay tumor onset and/or progression. 

5 Example 11: Making Peptide-Oligo Conjugates 

One method of making a peptide-oligo conjugate is to use Fmoc chemistry and 
synthesize the peptide from an oligo synthesized on amino-CPG. Deprotection of peptides 
synthesized using Fmoc chemistry requires 50% TFA and t-boc synthesized peptides 
require HF. Another method is to use a heterobifunctional crosslinking reagent to link a 

10 synthetic peptide, containing an N-terminal lysine, to a 5 '-Thiol modified oligo or 

conversely a 5'-amino modified oligo to a cysteine containing peptide. An example of a 
crosslinking reagent is N-Maleimido-6-aminocaproyl- ^'-nitro^'-sulfonic acid)-phenyl 
ester . Na+ (mal-sac-HNSA) from Bachem Bioscience (cat. # Q-1615). Reaction of this 
crosslinker with an amino group releases the dianion phenolate, l-hydroxy-2-nitro -4- 

15 benzene sulfonic acid a yellow chromophore. The chromophore allows both quantitation of 
the coupling reaction as well as act as an aid in monitoring the seperation of ^activated 
peptide from free crosslinking reagent using gel filtration. 

Method A: Couple Peptide Amine To Oligo Thiol (peptide MW should be > 5,000 to be 

20 excluded from desalting column). 

The following is an example of one method that can be used to couple a peptide 
amine to an oligo thiol. Step 1 : Synthesize a peptide with an N-terminal, or internal, lysine 
(The epsilon amino group is more reactive than an alpha amino group). This can be done for 
the TAT peptide. Step 2: Synthesize an oligonucleotide with a 5 f Thiol group. These are 

25 available from Glen Research. Step 3: React peptide with excess mal-sac-HNSA (pH 7.5 
Sodium phosphate). Step 4: Separation of peptide-mal-sac conjugate from free crosslinker 
and buffer exchange (pH 6.0 Sodium phosphate) using a gel filtration column (NAP or eq.). 
(Peptide must be large enough to seperate from the free linker which can be visualized as a 
yellow band. Do not collect yellow band with peptide. Step 5: Activate thiol modified oligo, 

30 desalt and buffer exchange (pH 6 Sodium phosphate) on NAP 5 column. Step 6: React 

acitvated peptide with Thiol modified oligo. Step 7: Purify Peptide-Oligo conjugate by ion 
exchange chromatography on Nucleogen DEAE-500-10 or eq. Elution order: free peptide, 
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peptide-oligo, free oligo. 

Method B: Couple Oligo Amine To Peptide Cysteine (oligos > 15mers are excluded from 
desalting column). 

5 The following is an example of one method that can be used to couple an oligo 

amine to a peptide cysteine. Use above procedure switching oligo for peptide. Step 1 : 
Synthesize a peptide with an N-terminal, or internal, cysteine. In this method, Tat is a basic 
protein with numerous reactive lysine side-chains. Step 2: Synthesize an oligonucleotide 
with a 5* amino modifier.Step 3: Purify oligo Trityl-on by RP HPLC or cartridge. Step 4: 

10 React oligo with excess mal-sac-HNSA (pH 7.5 Sodium phosphate). Step 5: Separation of 
oligo-mal-sac conjugate from free crosslinker and buffer exchange (pH 6 Sodium 
phosphate) using a gel filtration column (NAP or eq.). Note oligo must be large enough to 
seperate from the free linker which can be visualized as a yellow band. Do not collect 
yellow band with oligo. Step 6: Dissolve peptide in pH 6.0 Sodium phosphate buffer and 

15 react with activated oligo. Step 7: Purify Peptide-Oligo conjugate by ion exchange 

chromatography on Nucleogen DEAE-500-10 or eq. Elution order: free peptide, peptide- 
oligo, free oligo. 

Throughout this application, various publications are referenced. The disclosures of 
these publications in their entireties are hereby incorporated by reference into this 
20 application in order to more fully describe the state of the art to which this invention 
pertains. 

It will be apparent to those skilled in the art that various modifications and variations 
can be made in the present invention without departing from the scope or spirit of the 
invention. Other embodiments of the invention will be apparent to those skilled in the art 
25 from consideration of the specification and practice of the invention disclosed herein. It is 
intended that the specification and examples be considered as exemplary only, with a true 
scope and spirit of the invention being indicated by the following claims. 
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What is claimed is: 

1 . A context-dependent cytidine deaminase inhibitor comprising a nucleoside 
incorporated in a polymeric substrate, wherein the polymeric substrate targets a 
cytidine deaminase. 

2. The context-dependent cytidine deaminase inhibitor of claim 1, wherein the 
nucleoside is susceptible to nucleophilic attack. 

3. The context-dependent cytidine deaminase inhibitor of claim 1, wherein the 
nucleoside comprises a sp 2 carbon center susceptible to nucleophilic attack. 

4. The context-dependent cytidine deaminase inhibitor of claim 3, wherein an 
electrophilic group is present at the sp 2 carbon center. 

5. The context-dependent cytidine deaminase inhibitor of claim 3, wherein the sp 2 
carbon center is a carbonyl group, an imino group, or an alkenyl group. 

6. The context-dependent cytidine deaminase inhibitor of claim 1, wherein the 
nucleoside is susceptible to nucleophilic attack by a metal-bound hydroxide. 

7. The context-dependent cytidine deaminase inhibitor of claim 6, wherein the metal- 
bound hydroxide is zinc hydroxide. 

8. The context-dependent cytidine deaminase inhibitor of claim 1, wherein the 
nucleoside comprises a purine. 

9. The context-dependent cytidine deaminase inhibitor of claim 1, wherein the 
nucleoside comprises a pyrimidine. 

10. The context-dependent cytidine deaminase inhibitor of claim 1, wherein when the 
nucleoside comprises a pyrimidine or purine, and wherein nucleophilic attack occurs 
at C4 of the pyrimidine or C6 of the purine. 

11. The context-dependent cytidine deaminase inhibitor of claim 10, wherein the purine 
has an electron-withdrawing group at C4 of the pyrimidine or C6 of the purine. 

12. The context-dependent cytidine deaminase inhibitor of claim 1, wherein the 
nucleoside has at least one sp 3 carbon center. 

13. The context-dependent cytidine deaminase inhibitor of claim 12, wherein the sp 3 
carbon center has at least one -OR group, wherein R is hydrogen, alkyl, aryl, 
aralkyl, alkenyl, alkynyl, or acyl. 
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14. The context-dependent cytidine deaminase inhibitor of claim 1, wherein when the 
nucleoside comprises a pyrimidine or purine, the nucleoside comprises a sp 3 carbon 
center at C4 of the pyrimidine or C6 of the purine. 

15. The context-dependent cytidine deaminase inhibitor of claim 1, wherein the 
nucleoside comprises at least one group capable of hydrogen bonding. 

16. The context-dependent cytidine deaminase inhibitor of claim 1, wherein the 
nucleoside has the formula I or II 



X 




OR 2 R 3 
I 

II 

wherein 



R\R 2 , andR 3 are, independently, hydrogen, hydroxyl, alkoxy, halo, alkyl, 
acyl, aryl, aralkyl, monophosphate, diphosphate, triphosphate ,a phosphate 
derivative, N 3 , NR 4 R 5 , N0 2 , NOR 6 , CN, -C(0)NH 2 , SH, -S-alkyl, -S-aryl, or 
a residue of the polymeric substrate, wherein at least one of R 2 or R 3 is 
hydroxyl; 

X and Y are, independently, hydrogen, hydroxyl, alkoxy, alkyl, acyl, aryl, 
aralkyl, NR 4 R 5 , or an electron- withdrawing group; 



wherein R 4 , R 5 , and R 6 are, independently, alkyl, aryl, aralkyl, alkaryl, acyl, 
or hydrogen 
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Z is nitrogen or CR 7 , wherein R 7 is hydrogen, hydroxyl, alkoxy, halo, alkyl, 
acyl, aryl, or aralkyl. 

17. The context-dependent cytidine deaminase inhibitor of claim 16, wherein X is an 
electron-withdrawing group and wherein the electron-withdrawing group comprises 
a nitro group, a halo group, a cyano group, an ester group, an aldehyde group, a keto 
group, a sulfone group, or an amide group. 

18. The context-dependent cytidine deaminase inhibitor of claim 16, wherein when the 
nucleoside is formula I, R 2 is hydroxyl and R 3 is hydroxyl, halo, or alkoxy. 

19. The context-dependent cytidine deaminase inhibitor of claim 18, wherein R 3 is 
hydroxyl. 

20. The context-dependent cytidine deaminase inhibitor of claim 19, wherein X is halo. 

21. The context-dependent cytidine deaminase inhibitor of claim 20, wherein the halo is 
fluoro, chloro, bromo, or iodo. 

22. The context-dependent cytidine deaminase inhibitor of claim 21, wherein Z is CH. 

23. The context-dependent cytidine deaminase inhibitor of claim 16, wherein when the 
nucleoside is formula II, R 2 is hydroxyl and R 3 is hydroxyl, halo, or alkoxy. 

24. The context-dependent cytidine deaminase inhibitor of claim 23, wherein X is 
hydrogen, hydroxyl, or NR 4 R 5 , and Y is hydrogen. 

25. The context-dependent cytidine deaminase inhibitor of claim 23, wherein Z is 
nitrogen. 

26. The context-dependent cytidine deaminase inhibitor of claim 23, wherein Z is CR 7 , 
wherein R 7 is hydrogen or hydroxyl. 

27. The context-dependent cytidine deaminase inhibitor of claim 1, wherein the 
nucleoside is 4-fluorozebularine. 

28. The context-dependent cytidine deaminase inhibitor of claim 1, wherein the 
nucleoside is 2'-deoxy, 2'-fluorozebularine. 

29. The context-dependent cytidine deaminase inhibitor of claim 1, wherein the 
nucleoside is 3,4,5,6-tetrahydro-2'-deoxyuridine. 

30. The context-dependent cytidine deaminase inhibitor of claim 1, wherein the 
nucleoside is nebularine, 8-azanebularine, or coformycin. 

31. The context-dependent cytidine deaminase inhibitor of claim 1, wherein the 
nucleoside is zebularine. 
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32. The context-dependent cytidine deaminase inhibitor of claim 1, wherein the 
nucleoside has the formula III 



R 1 , R 2 , and R 3 are, independently, hydrogen, hydroxyl, alkoxy, halo, alkyl, 
acyl, aryl, aralkyl, monophosphate, diphosphate, triphosphate, a phosphate 
derivative (such as a phosphorothioate, phosphoramidate, phosphonate, or a 



2'-5' phosphodiester linkage), N 3 , NR 4 R 5 , N0 2 , NOR 6 , CN, -C(0)NH 2 , SH, - 



wherein R 4 , R 5 , R 6 , R 8 , and R 9 are, independently, alkyl, aryl, aralkyl, 
alkaryl, acyl, or hydrogen; 

X and Y (of Structure II above) are, independently, hydrogen, hydroxyl, 
alkoxy, alkyl, acyl, aryl, aralkyl, NR 4 R 5 , or an electron-withdrawing group; 

Z is nitrogen or CR 7 , wherein R 7 is hydrogen, hydroxyl, alkoxy, halo, alkyl, 
acyl, aryl, or aralkyl. 



33. The context-dependent cytidine deaminase inhibitor of claim 1, wherein the 
polymeric substrate comprises oligonucleotide residues. 




Ill 



wherein 



S-alkyl, -S-aryl, wherein at least one of R or R is hydroxyl; 
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34. The context-dependent cytidine deaminase inhibitor of claim 33, wherein the 
oligonucleotide residues comprise up to about 1 8 residues. 

35. The context-dependent cytidine deaminase inhibitor of claim 33, wherein the 
oligonucleotide residues comprise RNA residues. 

36. The context-dependent cytidine deaminase inhibitor of claim 35, wherein the RNA 
residues form single stranded RNA. 

37. The context-dependent cytidine deaminase inhibitor of claim 35, wherein the RNA 
residues form double stranded RNA. 

38. The context-dependent cytidine deaminase inhibitor of claim 37, wherein the double 
stranded RNA comprises a single stranded replication-like bubble. 

39. The context-dependent cytidine deaminase inhibitor of claim 35, wherein the 
nucleoside comprises zebularine. 

40. The context-dependent cytidine deaminase inhibitor of claim 33, wherein the 
oligonucleotide residues comprise DNA residues. 

41. The context-dependent cytidine deaminase inhibitor of claim 40, wherein the DNA 
residues form a single stranded DNA. 

42. The context-dependent cytidine deaminase inhibitor of claim 40 wherein the DNA 
residues form a double stranded DNA. 

43. The context-dependent cytidine deaminase inhibitor of claim 42, wherein the double 
stranded DNA comprises a single stranded replication bubble. 

44. The context-dependent cytidine deaminase inhibitor of claim 33, wherein the 
oligonucleotide residues comprise single stranded residues of both DNA and RNA. 

45. The context-dependent cytidine deaminase inhibitor of claim 33, wherein the 
oligonucleotide residues comprise double stranded residues of both DNA and RNA. 

46. The context-dependent cytidine deaminase inhibitor of claim 33, wherein the double 
stranded residues comprise RNA residues on one strand and DNA residues on 
another strand. 

47. The context-dependent cytidine deaminase inhibitor of claim 1, wherein the targeted 
cytidine deaminase is APOBEC-1. 

48. The context-dependent cytidine deaminase inhibitor of claim 47 comprising SEQ 
IDNO:l. 

49. The context-dependent cytidine deaminase inhibitor of claim 1, wherein the targeted 
cytidine deaminase is an APOBEC-related protein. 
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50. The context-dependent cytidine deaminase inhibitor of claim 49, wherein the 
APOBEC-related protein is AID. 

51. The context-dependent cytidine deaminase inhibitor of claim 50, comprising 
WRZY, wherein W is A or T, wherein R is a purine, wherein Z is the nucleoside, 
and wherein Y is a pyrimidine. 

52. The context-dependent cytidine deaminase inhibitor of claim 49, wherein the 
APOBEC-related protein is CEM15. 

53. The context-dependent cytidine deaminase inhibitor of claim 52, comprising SEQ 
IDNO:2. 

54. A context-dependent cytidine deaminase inhibitor comprising a nucleoside 
incorporated in a polymeric substrate, wherein the polymeric substrate targets a 
cytidine deaminase auxiliary protein. 

55. The context-dependent cytidine deaminase inhibitor of claim 54, wherein the 
targeted cytidine deaminase auxiliary protein is APOBEC-1 Complementing Factor. 

56. The context-dependent cytidine deaminase inhibitor of claim 55 comprising SEQ ED 
NO:3. 

57. The context-dependent cytidine deaminase inhibitor of claim 54, wherein the 
targeted cytidine deaminase auxiliary protein is APOBEC-1 Stimulating Protein 
(ASP). 

58. The context-dependent cytidine deaminase inhibitor of claim 57 comprising SEQ ID 
NO:4. 

59. A nucleoside having the formula I 
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X 




I 

wherein 

R 1 , R 2 , and R 3 are, independently, hydrogen, hydroxyl, alkoxy, halo, alkyl, 
acyl, aryl, aralkyl, monophosphate, diphosphate, triphosphate ,a phosphate 
derivative, N 3 , NR 4 R 5 , N0 2 , NOR 6 , CN, -C(0)NH 2 , SH, -S-alkyl, -S-aryl, 
wherein at least one of R 2 or R 3 is hydroxyl; 

wherein R 4 , R 5 , and R 6 are, independently, alkyl, aryl, aralkyl, alkaryl, acyl, 
or hydrogen; 

X is an electron-withdrawing group, and 

Z is nitrogen or CR 7 , wherein R 7 is hydrogen, hydroxyl, alkoxy, halo, alkyl, 
acyl, aryl, or aralkyl, 

wherein the nucleoside is not uracil, thymine, cytosine, zebularine. 

60. The nucleoside of claim 59, wherein X comprises a nitro group, a halo group, a 
cyano group, an ester group, an aldehyde group, a keto group, a sulfone group, or an 
amide group. 

61. The nucleoside of claim 59, R 1 and R 2 are hydroxyl, and R 3 is hydroxyl, halo, or 
alkoxy. 
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62. The nucleoside of claim 61, wherein R 3 is hydroxyl. 

63. The nucleoside of claim 62, wherein X is halo. 

64. The nucleoside of claim 63, wherein the halo is fluoro, chloro, bromo, or iodo. 

65. The nucleoside of claim 64, wherein Z is CH. 

66. The nucleoside of claim 59, wherein the nucleoside is 4-fluorozebularine. 

67. The nucleoside of claim 59, wherein the nucleoside is 2'-deoxy, 2'-fluorozebularine. 

68. A nucleoside having the formula III 



R 1 , R 2 , and R 3 are, independently, hydrogen, hydroxyl, alkoxy, halo, alkyl, 
acyl, aryl, aralkyl, monophosphate, diphosphate, triphosphate ,a phosphate 
derivative, N 3 , NR 4 R 5 , N0 2 , NOR 6 , CN, -C(0)NH 2 , SH, -S-alkyl, -S-aryl, 
wherein at least one of R 2 or R 3 is hydroxyl; 

wherein R 4 , R 5 , R 6 , R 8 , and R 9 are, independently, alkyl, aryl, aralkyl, 
alkaryl, acyl, or hydrogen; 

X is an electron-withdrawing group, and 

Z is nitrogen or CR 7 , wherein R 7 is hydrogen, hydroxyl, alkoxy, halo, alkyl, 
acyl, aryl, or aralkyl. 




Ill 



wherein 
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69. A polymeric substrate having at least one nucleoside of claims 59-68 incorporated 
within the substrate. 

70. A method of inhibiting a cytidine deaminase comprising the steps of contacting a 
cell containing the cytidine deaminase with the context-dependent cytidine 
deaminase inhibitor of claim 1 under conditions that allow the inhibitor to inhibit the 
deaminase function of the cytidine deaminase. 

71. The method of claim 70, wherein the contacting step is in vivo. 

72. The method of claim 70, wherein the contacting step is in vitro. 

73. A method of inhibiting a cytidine deaminase comprising the steps of contacting a 
cell containing the cytidine deaminase with the context-dependent cytidine 
deaminase inhibitor of claim 54 under conditions that allow the inhibitor to inhibit 
the deaminase function of the cytidine deaminase. 

74. The method of claim 73, wherein the contacting step is in vivo. 

75. The method of claim 73, wherein the contacting step is in vitro. 

76. A method of treating a subject with a disorder associated with a cytidine deaminase 
comprising administering to the subject a therapeutic amount of the context- 
dependent cytidine deaminase inhibitor of claim 1 . 

77. The method of claim 76, wherein the disorder is associated with AID. 

78. The method of claim 77, wherein the disorder is a B cell malignancy. 

79. The method of claim 78, wherein the B cell malignancy is a diffuse large B cell 
lymphoma. 

80. The method of claim 78, wherein the B cell malignancy is a chronic lymphatic 
leukemia. 

81. The method of claim 77 wherein the disorder is a T cell lymphoma. 

82. The method of claim 77, wherein the disorder is a pulmonary adenoma. 

83. The method of claim 76, wherein the disorder is associated with APOBEC-1. 

84. The method of claim 83, wherein the disorder is cancer. 

85. The method of claim 84, wherein the cancer is colorectal cancer. 

86. The method of claim 83, wherein the disorder is neurofibromatosis. 

87. The method of claim 76, wherein the disorder is associated with CEM15. 

88. The method of claim 87, wherein the disorder is cancer. 

89. The method of claim 88, wherein the cancer is breast cancer. 
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90. A composition comprising the context-dependent cytidine deaminase inhibitor of 
claim 1 and a pharmaceutical carrier. 

91. A cell comprising the context-dependent cytidine deaminase inhibitor of claim 1 . 

92. A method of promoting retroviral incorporation in a subject comprising the steps of 
administering to the subject a therapeutic amount of the context-dependent cytidine 
deaminase inhibitor of claim 1 . 

93. A method of promoting incorporation of a viral vector in a subject comprising the 
steps of administering to the subject a therapeutic amount of the context-dependent 
cytidine deaminase inhibitor of claim 1 and administering the viral vector to the 
subject. 

94. A method of treating a subject with a disorder associated with a cytidine deaminase 
comprising administering to the subject a therapeutic amount of the context- 
dependent cytidine deaminase inhibitor of claim 54. 

95. The method of claim 94, wherein the disorder is associated with AID. 

96. The method of claim 95, wherein the disorder is a B cell malignancy. 

97. The method of claim 96, wherein the B cell malignancy is a diffuse large B cell 
lymphoma. 

98. The method of claim 96, wherein the B cell malignancy is a chronic lymphatic 
leukemia. 

99. The method of claim 94 wherein the disorder is a T cell lymphoma. 

100. The method of claim 94, wherein the disorder is a pulmonary adenoma. 

101. The method of claim 94, wherein the disorder is associated with APOBEC-1 . 

102. The method of claim 101, wherein the disorder is cancer. 

103. The method of claim 102, wherein the cancer is colorectal cancer. 

104. The method of claim 101, wherein the disorder is neurofibromatosis. 

105. The method of claim 94, wherein the disorder is metabolic syndrome. 

106. The method of claim 94, wherein the disorder is cardiovascular disease. 

107. The method of claim 94, wherein the disorder is fatty liver disease. 

108. The method of claim 94, wherein the disorder is associated with CEM15. 

109. The method of claim 108, wherein the disorder is cancer. 

1 10. The method of claim 109, wherein the cancer is breast cancer. 

111. A composition comprising the context-dependent cytidine deaminase inhibitor of 
claim 54 and a pharmaceutical carrier. 
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112. A method of promoting incorporation of a viral vector in a subject comprising 
the steps of comprising administering to the subject a therapeutic amount of the 
context-dependent cytidine deaminase inhibitor of claim 54 and administering the 
viral vector to the subject. 

113. A cell comprising the context-dependent cytidine deaminase inhibitor of claim 
54. 

1 14. A method of screening for a context-dependent cytidine deaminase inhibitor 
comprising the steps of 

(a) contacting a cytidine deaminase with an agent to be screened, wherein the 
agent is incorporated in a polymeric substrate, wherein the polymeric 
substrate targets the cytidine deaminase, and 

(b) determining the level of cytidine deaminase activity, 

a reduction in cytidine deaminase activity, as compared to cytidine deaminase 
activity in the absence of the agent to be screened, indicating a context-dependent 
cytidine deaminase inhibitor. 

115. The method of claim 114, wherein the cytidine deaminase is in a cell. 

116. The method of claim 114, wherein the polymeric substrate comprises 
oligonucleotide residues. 

1 17. The method of claim 116, wherein the oligonucleotide residues comprise up to 
about 1 8 residues. 

118. The method of claim 116, wherein the oligonucleotide residues comprise RNA 
residues. 

119. The method of claim 118, wherein the RNA residues form a single stranded 
RNA. 

120. The method of claim 118, wherein the RNA residues form a double stranded 
RNA. 

121 . The method of claim 120, wherein the double stranded RNA comprises a single 
stranded replication bubble. 

122. The method of claim 116, wherein the oligonucleotide residues comprise DNA 
residues. 

123. The method of claim 122, wherein the DNA residues form a single stranded 
DNA. 
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124. The method of claim 122, wherein the DNA residues form a double stranded 
DNA. 

125. The method of claim 124, wherein the double stranded DNA comprises a single 
stranded replication bubble. 

126. The method of claim 116, wherein the oligonucleotide residues comprise single 
stranded residues of both DNA and RNA. 

127. The method of claim 116, wherein the oligonucleotide residues double stranded 
residues of both DNA and RNA. 

128. The method of claim 127, wherein the double stranded residues comprise RNA 
residues on one strand and DNA residues on another strand. 
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Figure 1: The Cytidine Deaminase (CDA) Reaction 
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Figure 2: 
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Figure 3: The Editosome Model for C-to-U deamination 
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Figure 4: Zebularine Mimics for CDA Inhibition 
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Figure 5: Context Dependent Inhibition 
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* Flanking RNA or DNA could be made more stable [resistant 
to chemical or enzymatic (nuclease) degradation] by use of 
phosphorothioate strands, 2*-OMe, 2'-F, or 2'-NH3 substitution 
incorporated at the level of chemical synthesis. 
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Figure 10 



6/9 



WO 2005/115410 



PCT/US2005/016001 




Figure 1 1 
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SEQUENCE LISTING 

<110> University of Rochester 

Joseph E. WEDEKIND 
Harold C. SMITH 

<12 0> CONTEXT DEPENDENT INHIBITORS OF CYTIDINE 
DEAMINASES AND USES THEREOF 



<130> 21108. 0039P1 

<150> 60/586,490 
<151> 2004-06-04 

<160> 22 

<170> FastSEQ for Windows Version 4.0 

<210> 1 
<211> 236 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence ; note = 
synthetic construct 



<400> 1 



Met 


Thr 


Ser 


Glu 


Lys 


Gly 


Pro 


Ser 


Thr 


Gly 


Asp 


Pro 


Thr 


Leu 


Arg 


Arg 


1 








5 










10 










15 




Arg 


He 


Glu 


Pro 


Trp 


Glu 


Phe 


Asp 


Val 


Phe 


Tyr 


Asp 


Pro 


Arg 


Glu 


Leu 








20 










25 










30 






Arg 


Lys 


Glu 


Ala 


Cys 


Leu 


Leu 


Tyr 


Glu 


He 


Lys 


Trp 


Gly 


Met 


Ser 


Arg 






35 










40 










45 








Lys 


He 


Trp 


Arg 


Ser 


Ser 


Gly 


Lys 


Asn 


Thr 


Thr 


Asn 


His 


Val 


Glu 


Val 




50 










55 










60 










Asn 


Phe 


He 


Lys 


Lys 


Phe 


Thr 


Ser 


Glu 


Arg 


Asp 


Phe 


His 


Pro 


Ser 


He 


65 










70 










75 










80 


Ser 


Cys 


Ser 


He 


Thr 


Trp 


Phe 


Leu 


Ser 


Trp 


Ser 


Pro 


Cys 


Trp 


Glu 


Cys 










85 










90 










95 




Ser 


Gin 


Ala 


He 


Arg 


Glu 


Phe 


Leu 


Ser 


Arg 


His 


Pro 


Gly 


Val 


Thr 


Leu 








100 










105 










110 






Val 


He 


Tyr 


Val 


Ala 


Arg 


Leu 


Phe 


Trp. 


His 


Met 


Asp 


Gin 


Gin 


Asn 


Arg 






115 










120 










125 








Gin 


Gly 


Leu 


Arg 


Asp 


Leu 


Val 


Asn 


Ser 


Gly 


Val 


Thr 


He 


Gin 


He 


Met 




130 










135 










140 










Arg 


Ala 


Ser 


Glu 


Tyr 


Tyr 


His 


Cys 


Trp 


Arg 


Asn 


Phe 


Val 


Asn 


Tyr 


Pro 


145 










150 










155 










160 


Pro 


Gly 


Asp 


Glu 


Ala 


His 


Trp 


Pro 


Gin 


Tyr 


Pro 


Pro 


Leu 


Trp 


Met 


Met 










165 










170 










175 




Leu 


Tyr 


Ala 


Leu 


Glu 


Leu 


His 


Cys 


He 


He 


Leu 


Ser 


Leu 


Pro 


Pro 


Cys 






180 










185 










190 






Leu 


Lys 


He 


Ser 


Arg 


Arg 


Trp 


Gin 


Asn 


His 


Leu 


Thr 


Phe 


Phe 


Arg 


Leu 






195 










200 










205 








His 


Leu 


Gin 


Asn 


Cys 


His 


Tyr 


Gin 


Thr 


He 


Pro 


Pro 


His 


He 


Leu 


Leu 




210 










215 










220 










Ala 


Thr 


Gly 


Leu 


He 


His 


Pro 


Ser 


Val 


Ala 


Trp 


Arg 











225 230 235 



1 



WO 2005/115410 



PCT/US2005/016001 



<210> 2 

<211> 863 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence; note = 
synthetic construct 



<400> 2 

gatcccagag gaggaagtcc agagacagag caccatgact tctgagaaag gagaagaatc 60 

gaaccctggg agtttgacgt cttctatgac cccagagaac ttcgtaaaga ggcctgtctg 120 

ctctacgaaa tcaagtgggg catgagccgg aagatctggc gaagctcagg caaaaacacc 180 

accaatcacg tggaagttaa ttttataaaa aaatttacgt cagaaagaga ttttcaccca 240 

tccatcagct gctccatcac ctggttcttg tcctggagtc cctgctggga atgctcccag 300 

gctattagag agtttctgag tcggcaccct ggtgtgactc tagtgatcta cgtagctcgg 360 

cttttttggc acatggatca acaaaatcgg caaggtctca gggaccttgt taacagtgga 420 

gtaactattc agattatgag agcatcagag tattatcact gctggaggaa ttttgtcaac 480 

tacccacctg gggatgaagc tcactggcca caatacccac ctctgtggat gatgttgtac 540 

gcactggagc tgcactgcat aattctaagt cttccaccct gtttaaagat ttcaagaaga 600 

tggcaaaatc atcttacatt tttcagactt catcttcaaa actgccatta ccaaacgatt 660 

ccgccacaca tccttttagc tacagggctg atacatcctt ctgtggcttg gagatgaata 720 

ggatgattcc gtgtgtgtac tgattcaaga acaagcaatg atgacccact aaagagtgaa 780 

tgccatttag aatctagaaa tgttcacaag gtaccccaaa actctgtagc ttaaaccaac 840 

aataaatatg tattacctct ggc 863 



<210> 3 
<211> 424 
<212> PRT 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence; note = 
synthetic construct ' 



<400> 3 






























Met 


Gly 


Pro 


Phe 


Cys 


Leu 


Gly 


Cys 


Ser 


His 


Arg 


Lys 


Cys 


Tyr 


Ser 


Pro 


1 








5 










10 










15 




He 


Arg 


Asn 


Leu 
20 


He 


Ser 


Gin 


Glu 


Thr 
25 


Phe 


Lys 


Phe 


His 


Phe 
30 


Lys 


Asn 


Leu Gly 


Tyr 


Ala 


Lys 


Gly 


Arg 


Lys 


Asp 


Thr 


Phe 


Leu 


Cys 


Tyr 


Glu 


Val 






35 










40 










45 








Thr 


Arg 


Lys 


Asp 


Cys 


Asp 


Ser 


Pro 


Val 


Ser 


Leu 


His 


His 


Gly Val 


Phe 




50 










55 










60 










Lys 


Asn 


Lys 


Asp 


Asn 


He 


His 


Ala 


Glu 


He 


Cys 


Phe 


Leu 


Tyr 


Trp 


Phe 


65 










70 










75 










80 


His 


Asp 


Lys 


Val 


Leu 
85 


Lys 


Val 


Leu 


Ser 


Pro 
90 


Arg 


Glu 


Glu 


Phe 


Lys 
95 


He 


Thr 


Trp 


Tyr 


Met 
100 


Ser 


Trp 


Ser 


Pro 


Cys 
105 


Phe 


Glu 


Cys 


Ala 


Glu 
110 


Gin 


He 


Val 


Arg 


Phe 


Leu 


Ala 


Thr 


His 


His 


Asn 


Leu 


Ser 


Leu 


Asp 


He 


Phe 


Ser 




115 










120 










125 








Ser Arg 


Leu 


Tyr 


Asn 


Val 


Gin 


Asp 


Pro 


Glu 


Thr 


Gin 


Gin 


Asn 


Leu 


Cys 




130 










135 










140 










Arg 


Leu 


Val 


Gin 


Glu 


Gly 


Ala 


Gin 


Val 


Ala 


Ala 


Met 


Asp 


Leu 


Tyr 


Glu 


145 










150 










155 










160 


Phe 


Lys 


Lys 


Cys 


Trp 
165 


Lys 


Lys 


Phe 


Val 


Asp 
170 


Gly 


Gly 


Arg 


Arg 


Phe 
175 


Arg 


Pro 


Trp 


Lys 


Arg 
180 


Leu 


Leu 


Thr 


Asn 


Phe 
185 


Arg 


Tyr 


Gin 


Asp 


Ser 
190 


Lys 


Leu 



2 



WO 2005/115410 



PCT/US2005/016001 



Gin 


Glu 


He 
195 


Leu 


Arg 


Pro 


Cys 


Tyr 
200 


He 


Pro 


Val 


Pro 


Ser 
205 


Ser 


Ser 


Ser 


Ser 


Thr 
210 


Leu 


Ser 


Asn 


He 


Cys 
215 


Leu 


Thr 


Lys 


Gly 


Leu 
220 


Pro 


Glu 


Thr 


Arg 


Phe 


Cys 


Val 


Glu 


Gly 


Arg 


Met 


Asp 


Pro 


Leu 


Ser 


Glu 


Glu 


Glu 


Phe 


Tyr 


225 










230 










235 










240 


Ser 


Gin 


Phe 


Tyr 


Asn 
245 


Gin 


Arg 


Val 


Lys 


His 
250 


Leu 


Cys 


Tyr 


Tyr 


His 
255 


Arg 


Met 


Lys 


Pro 


Tyr 
260 


Leu 


Cys 


Tyr 


Gin 


Leu 
265 


Glu 


Gin 


Phe 


Asn 


Gly 
270 


Gin 


Ala 


Pro 


Leu 


Lys 
275 


Gly 


Cys 


Leu 


Leu 


Ser 
280 


Glu 


Lys 


Gly 


Lys 


Gin 
285 


His 


Ala 


Glu 


Leu 


Phe 
290 


Leu 


Asp 


Lys 


He 


Arg 
295 


Ser 


Met 


Glu 


Leu 


Ser 
300 


Gin 


Val 


Thr 


He 


Thr 


Cys 


Tyr 


Leu 


Thr 


Trp 


Ser 


Pro 


Cys 


Pro 


Asn 


Cys 


Ala 


Trp 


Gin 


Leu 


305 










310 










315 










320 


Ala 


Ala 


Phe 


Lys 


Arg 
325 


Asp 


Arg 


Pro 


Asp 


Leu 
330 


He 


Leu 


His 


He 


Tyr 
335 


Thr 


Ser 


Arg 


Leu 


Tyr 
340 


Phe 


His 


Trp 


Lys 


Arg 
345 


Pro 


Phe 


Lys 


Gly 


Leu 
350 


Cys 


Ser 


Leu 


Trp 


Gin 
355 


Ser 


Gly 


He 


Leu 


Val 
360 


Asp 


Val 


Met 


Asp 


Leu 
365 


Pro 


Gin 


Phe 


Thr 


Asp 
370 


Cys 


Trp 


Thr 


Asn 


Phe 
375 


Val 


Asn 


Pro 


Lys 


Arg 
380 


Pro 


Phe 


Trp 


Pro 


Trp 


Lys 


Gly 


Leu 


Glu 


He 


He 


Ser 


Arg 


Arg 


Thr 


Gin 


Arg Arg 


Leu 


Arg 


385 










390 










395 










400 


Arg 


He 


Lys 


Glu 


Ser 
405 


Trp 


Leu 


Gin 


Asp 


Leu 
410 


Val 


Asn 


Asp 


Phe 


Gly 
415 


Asn 


Leu 


Gin 


Leu 


Gly 
420 


Pro 


Pro 


Met 


Ser 



















<210> 4 
<211> 9229 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence; note = 
synthetic construct 



<400> 4 

ggtctctctg gttagaccag atttgagcct gggagctctc tggctaacta gggaacccac 60 

tgcttaagcc tcaataaagc ttgccttgag tgcttcaagt agtgtgtgcc cgtctgttgt 120 

gtgactctgg taactagaga tccctcagac ccttttagtc agtgtggaaa atctctagca 180 

gtggcgcccg aacagggact tgaaagcgaa agggaaacca gaggagctct ctcgacgcag 240 

gactcggctt gctgaagcgc gcacggcaag aggcgagggg aggcgactgg tgagtacgcc 3 00 

aaaaattttg actagcggag gctagaagga gagagatggg tgcgagagcg tcagtattaa 3 60 

gcgggggaga attagatcga tgggaaaaaa ttcggttaag gccaggggga aagaaaaaat 42 0 

ataaattaaa acatatagta tgggcaagca gggagctaga acgattcgca gttaatcctg 480 

gcctgttaga aacatcagaa ggctgtagac aaatactggg acagctacaa ccatcccttc 540 

agacaggatc agaagaactt agatcattat ataatacagt agcaaccctc tattgtgtgc 600 

atcaaaggat agagataaaa gacaccaagg aagctttaga caagatagag gaagagcaaa 660 

acaaaagtaa gaaaaaagca cagcaagcag cagctgacac aggacacagc agccaggtca 72 0 

gccaaaatta ccctatagtg cagaacatcc aggggcaaat ggtacatcag gccatatcac 780 

ctagaacttt aaatgcatgg gtaaaagtag tagaagagaa ggctttcagc ccagaagtga 840 

tacccatgtt ttcagcatta tcagaaggag ccaccccaca agatttaaac accatgctaa 900 

acacagtggg gggacatcaa gcagccatgc aaatgttaaa agagaccatc aatgaggaag 960 

ctgcagaatg ggatagagtg catccagtgc atgcagggcc tattgcacca ggccagatga 102 0 

gagaaccaag gggaagtgac atagcaggaa ctactagtac ccttcaggaa caaataggat 1080 

ggatgacaaa taatccacct atcccagtag gagaaattta taaaagatgg ataatcctgg 114 0 

gattaaataa aatagtaaga atgtatagcc ctaccagcat tctggacata agacaaggac 12 00 
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caaaagaacc ctttagagac tatgtagacc ggttctataa aactctaaga gccgagcaag 1260 

cttcacagga ggtaaaaaat tggatgacag aaaccttgtt ggtccaaaat gcgaacccag 1320 

attgtaagac tattttaaaa gcattgggac cagcagctac actagaagaa atgatgacag 1380 

catgtcaggg agtgggagga cccggccata aggcaagagt tttggctgaa gcaatgagcc 144 0 

aagtaacaaa ttcagctacc ataatgatgc aaagaggcaa ttttaggaac caaagaaaga 1500 

ttgttaagtg tttcaattgt ggcaaagaag ggcacatagc cagaaattgc agggccccta 1560 

ggaaaaaggg ctgttggaaa tgtggaaagg aaggacacca aatgaaagat tgtactgaga 162 0 

gacaggctaa ttttttaggg aagatctggc cttcctacaa gggaaggcca gggaattttc 1680 

ttcagagcag accagagcca acagccccac catttcttca gagcagacca gagccaacag 1740 

ccccaccaga agagagcttc aggtctgggg tagagacaac aactccctct cagaagcagg 1800 

agccgataga caaggaactg tatcctttaa cttccctcag atcactcttt ggcaacgacc 1860 

cctcgtcaca ataaagatag gggggcaact aaaggaagct ctattagata caggagcaga 192 0 

tgatacagta ttagaagaaa tgagtttgcc aggaagatgg aaaccaaaaa tgataggggg 1980 

aattggaggt tttatcaaag taagacagta tgatcagata ctcatagaaa tctgtggaca 2 040 

taaagctata ggtacagtat tagtaggacc tacacctgtc aacataattg gaagaaatct 2100 

gttgactcag attggttgca ctttaaattt tcccattagt cctattgaaa ctgtaccagt 2160 

aaaattaaag ccaggaatgg atggcccaaa agttaaacaa tggccattga cagaagaaaa 222 0 

aataaaagca ttagtagaaa tttgtacaga aatggaaaag gaagggaaaa tttcaaaaat 2280 

tgggcctgaa aatccataca atactccagt atttgccata aagaaaaaag acagtactaa 2340 

atggagaaaa ttagtagatt tcagagaact taataagaga actcaagact tctgggaagt 2400 

tcaattagga ataccacatc ccgcagggtt aaaaaagaaa aaatcagtaa cagtactgga 2460 

tgtgggtgat gcatattttt cagttccctt agatgaagac ttcaggaagt atactgcatt 2520 

taccatacct agtataaaca atgagacacc agggattaga tatcagtaca atgtgcttcc 2580 

acagggatgg aaaggatcac cagcaatatt ccaaagtagc atgacaaaaa tcttagagcc 2 64 0 

ttttagaaaa caaaatccag acatagttat ctatcaatac atggatgatt tgtatgtagg 2700 

atctgactta gaaatagggc agcatagaac aaaaatagag gagctgagac aacatctgtt 2 760 

gaggtgggga cttaccacac cagacaaaaa acatcagaaa gaacctccat tcctttggat 2 82 0 

gggttatgaa ctccatcctg ataaatggac agtacagcct atagtgctgc cagaaaaaga 2 880 

cagctggact gtcaatgaca tacagaagtt agtgggaaaa ttgaattggg caagtcagat 2 940 

ttacccaggg attaaagtaa ggcaattatg taaactcctt agaggaacca aagcactaac 3 000 

agaagtaata ccactaacag aagaagcaga gctagaactg gcagaaaaca gagagattct 3 060 

aaaagaacca gtacatggag tgtattatga cccatcaaaa gacttaatag cagaaataca 3120 

gaagcagggg caaggccaat ggacatatca aatttatcaa gagccattta aaaatctgaa 3180 

aacaggaaaa tatgcaagaa cgaggggtgc ccacactaat gatgtaaaac aattaacaga 3240 

ggcagtgcaa aaaataacca cagaaagcat agtaatatgg ggaaagactc ctaaatttaa 3300 

actacccata caaaaggaaa catgggaaac atggtggaca gagtattggc aagccacctg 3360 

gattcctgag tgggagtttg tcaatacccc tcctttagtg aaattatggt accagttaga 3420 

gaaagaaccc atagtaggag cagaaacgtt ctatgtagat ggggcagcta gcagggagac 3480 

taaattagga aaagcaggat atgttactaa tagaggaaga caaaaagttg tcaccctaac 3540 

tgacacaaca aatcagaaga ctgagttaca agcaattcat ctagctttgc aggattcggg 3600 

attagaagta aatatagtaa cagactcaca atatgcatta ggaatcattc aagcacaacc 3 660 

agataaaagt gaatcagagt tagtcaatca aataatagag cagttaataa aaaaggaaaa 3 72 0 

ggtctatctg gcatgggtac cagcacacaa aggaattgga ggaaatgaac aagtagataa 3780 

attagtcagt gctggaatca ggaaagtact atttttagat ggaatagata aggcccaaga 3840 

tgaacatgag aaatatcaca gtaattggag agcaatggct agtgatttta acctgccacc 3900 

tgtagtagca aaagaaatag tagccagctg tgataaatgt cagctaaaag gagaagccat 3 960 

gcatggacaa gtagactgta gtccaggaat atggcaacta gattgtacac atttagaagg 4020 

aaaagttatc ctggtagcag ttcatgtagc cagtggatat atagaagcag aagttattcc 4080 

agcagaaaca gggcaggaaa cagcatactt tcttttaaaa ttagcaggaa gatggccagt 4140 

aaaaacaata catacagaca atggcagcaa tttcaccagt actacggtta aggccgcctg 4200 

ttggtgggcg ggaatcaagc aggaatttgg aattccctac aatccccaaa gtcaaggagt 4260 

agtagaatct atgaataaag aattaaagaa aattataggc caggtaagag atcaggctga 4320 

acatcttaag acagcagtac aaatggcagt attcatccac aattttaaaa gaaaaggggg 4380 

gattgggggg tacagtgcag gggaaagaat agtagacata atagcaacag acatacaaac 444 0 

taaagaatta caaaaacaaa ttacaaaaat tcaaaatttt cgggtttatt acagggacag 4500 

cagagatcca ctttggaaag gaccagcaaa gctcctctgg aaaggtgaag gggcagtagt 4560 

aatacaagat aatagtgaca taaaagtagt gccaagaaga aaagcaaaga tcattaggga 4 62 0 

ttatggaaaa cagatggcag gtgatgattg tgtggcaagt agacaggatg aggattagaa 4680 

catggaaaag tttagtaaaa caccatatgt atgtttcagg gaaagctagg ggatggtttt 4740 

atagacatca ctatgaaagc cctcatccaa gaataagttc agaagtacac atcccactag 4800 

gggatgctag attggtaata acaacatatt ggggtctgca tacaggagaa agagactggc 4 860 
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atctgggtca gggagtctcc atagaatgga ggaaaaagag atatagcaca caagtagacc 4 92 0 

ctgaactagc agaccaacta attcatctgt attactttga ctgtttttca gactctgcta 4980 

taagaaaggc cttattagga catatagtta gccctaggtg tgaatatcaa gcaggacata 5040 

acaaggtagg atctctacaa tacttggcac tagcagcatt aataacacca aaaaagataa 5100 

agccaccttt gcctagtgtt acgaaactga cagaggatag atggaacaag ccccagaaga 5160 

ccaagggcca cagagggagc cacacaatga atggacacta gagcttttag aggagcttaa 522 0 

gaatgaagct gttagacatt ttcctaggat ttggctccat ggcttagggc aacatatcta 5280 

tgaaacttat ggggatactt gggcaggagt ggaagccata ataagaattc tgcaacaact 534 0 

gctgtttatc catttcagaa ttgggtgtcg acatagcaga ataggcgtta ctcaacagag 5400 

gagagcaaga aatggagcca gtagatccta gactagagcc ctggaagcat ccaggaagtc 5460 

agcctaaaac tgcttgtacc acttgctatt gtaaaaagtg ttgctttcat tgccaagttt 5520 

gtttcacaac aaaagcctta ggcatctcct atggcaggaa gaagcggaga cagcgacgaa 5580 

gacctcctca aggcagtcag actcatcaag tttctctatc aaagcagtaa gtagtacatg 5640 

taatgcaacc tatacaaata gcaatagcag cattagtagt agcaataata atagcaatag 5700 

ttgtgtggtc catagtaatc atagaatata ggaaaatatt aagacaaaga aaaatagaca 5760 

ggttaattga tagactaata gaaagagcag aagacagtgg caatgagagt gaaggagaaa 5 82 0 

tatcagcact tgtggagatg ggggtggaaa tggggcacca tgctccttgg gatattgatg 5880 

atctgtagtg ctacagaaaa attgtgggtc acagtctatt atggggtacc tgtgtggaag 5 94 0 

gaagcaacca ccactctatt ttgtgcatca gatgctaaag catatgatac agaggtacat 6000 

aatgtttggg ccacacatgc ctgtgtaccc acagacccca acccacaaga agtagtattg 6060 

gtaaatgtga cagaaaattt taacatgtgg aaaaatgaca tggtagaaca gatgcatgag 612 0 

gatataatca gtttatggga tcaaagccta aagccatgtg taaaattaac cccactctgt 6180 

gttagtttaa agtgcactga tttggggaat gctactaata ccaatagtag taataccaat 6240 

agtagtagcg gggaaatgat gatggagaaa ggagagataa aaaactgctc tttcaatatc 6300 

agcacaagca taagaggtaa ggtgcagaaa gaatatgcat ttttttataa acttgatata 6360 

ataccaatag ataatgatac taccagctat acgttgacaa gttgtaacac ctcagtcatt 6420 

acacaggcct gtccaaaggt atcctttgag ccaattccca tacattattg tgccccggct 6480 

ggttttgcga ttctaaaatg taataataag acgttcaatg gaacaggacc atgtacaaat 654 0 

gtcagcacag tacaatgtac acatggaatt aggccagtag tatcaactca actgctgttg 6600 

aatggcagtc tagcagaaga agaggtagta attagatctg ccaatttcac agacaatgct 6660 

aaaaccataa tagtacagct gaaccaatct gtagaaatta attgtacaag acccaacaac 6720 

aatacaagaa aaagtatccg tatccagagg ggaccaggga gagcatttgt tacaatagga 6780 

aaaataggaa atatgagaca agcacattgt aacattagta gagcaaaatg gaatgccact 6840 

ttaaaacaga tagctagcaa attaagagaa caatttggaa ataataaaac aataatcttt 6900 

aagcaatcct caggagggga cccagaaatt gtaacgcaca gttttaattg tggaggggaa 6 96 0 

tttttctact gtaattcaac acaactgttt aatagtactt ggtttaatag tacttggagt 7020 

actgaagggt caaataacac tgaaggaagt gacacaatca cactcccatg cagaataaaa 7080 

caatttataa acatgtggca ggaagtagga aaagcaatgt atgcccctcc catcagcgga 7140 

caaattagat gttcatcaaa tattacaggg ctgctattaa caagagatgg tggtaataac 7200 

aacaatgggt ccgagatctt cagacctgga ggaggagata tgagggacaa ttggagaagt 7260 

gaattatata aatataaagt agtaaaaatt gaaccattag gagtagcacc caccaaggca 7320 

aagagaagag tggtgcagag agaaaaaaga gcagtgggaa taggagcttt gttccttggg 73 80 

ttcttgggag cagcaggaag cactatgggc gcacggtcaa tgacgctgac ggtacaggcc 7440 

agacaattat tgtctggtat agtgcagcag cagaacaatt tgctgagggc tattgaggcg 7500 

caacagcatc tgttgcaact cacagtctgg ggcatcaagc agctccaggc aagaatcctg 7560 

gctgtggaaa gatacctaaa ggatcaacag ctcctgggga tttggggttg ctctggaaaa 7620 

ctcatttgca ccactgctgt gccttggaat gctagttgga gtaataaatc tctggaacag 7680 

atttggaata acatgacctg gatggagtgg gacagagaaa ttaacaatta cacaagctta 774 0 

atacattcct taattgaaga atcgcaaaac cagcaagaaa agaatgaaca agaattattg 7800 

gaattagata aatgggcaag tttgtggaat tggtttaaca taacaaattg gctgtggtat 7860 

ataaaaatat tcataatgat agtaggaggc ttggtaggtt taagaatagt ttttgctgta 7 92 0 

ctttctatag tgaatagagt taggcaggga tattcaccat tatcgtttca gacccacctc 7980 

ccaaccccga ggggacccga caggcccgaa ggaatagaag aagaaggtgg agagagagac 8 04 0 

agagacagat ccattcgatt agtgaacgga tccttagcac ttatctggga cgatctgcgg 8100 

agcctgtgcc tcttcagcta ccaccgcttg agagacttac tcttgattgt aacgaggatt 8160 

gtggaacttc tgggacgcag ggggtgggaa gccctcaaat attggtggaa tctcctacag 8220 

tattggagtc aggaactaaa gaatagtgct gttagcttgc tcaatgccac agccatagca 8280 

gtagctgagg ggacagatag ggttatagaa gtagtacaag gagcttgtag agctattcgc 8340 

cacataccta gaagaataag acagggcttg gaaaggattt tgctataaga tgggtggcaa 8400 

gtggtcaaaa agtagtgtgg ttggatggcc tactgtaagg gaaagaatga gacgagctga 8460 

gccagcagca gatggggtgg gagcagcatc tcgagacctg gaaaaacatg gagcaatcac 852 0 
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aagtagcaat acagcagcta ccaatgctgc ttgtgcctgg ctagaagcac aagaggagga 8580 

<39aggtgggt tttccagtca cacctcaggt acctttaaga ccaatgactt acaaggcagc 864 0 

tgtagatctt agccactttt taaaagaaaa ggggggactg gaagggctaa ttcactccca 8700 

acgaagacaa gatatccttg atctgtggat ctaccacaca caaggctact tccctgattg 8760 

gcagaactac acaccagggc caggggtcag atatccactg acctttggat ggtgctacaa 8 82 0 

gctagtacca gttgagccag ataaggtaga agaggccaat aaaggagaga acaccagctt 8880 

gttacaccct gtgagcctgc atggaatgga tgaccctgag agagaagtgt tagagtggag 8940 

gtttgacagc cgcctagcat ttcatcacgt ggcccgagag ctgcatccgg agtacttcaa 9000 

gaactgctga catcgagctt gctacaaggg actttccgct ggggactttc cagggaggcg 9060 

tggcctgggc gggactgggg agtggcgagc cctcagatgc tgcatataag cagctgcttt 912 0 

ttgcctgtac tgggtctctc tggttagacc agatttgagc ctgggagctc tctggctaac 9180 

tagggaaccc actgcttaag cctcaataaa gcttgccttg agtgcttca 922 9 



<210> 5 
<211> 198 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223:> Description of Artificial 
synthetic construct 



<400> 5 



Met 


Asp 


Ser 


Leu 


Leu 


Met 


Asn 


Arg 


1 








5 








Asn 


Val 


Arg 


Trp 


Ala 


Lys 


Gly 


Arg 








20 










Val 


Lys 


Arg 


Arg 


Asp 


Ser 


Ala 


Thr 






35 










40 


Leu 


Arg 


Asn 


Lys 


Asn 


Gly 


Cys 


His 




50 










55 




lie 


Ser 


Asp 


Trp 


Asp 


Leu 


Asp 


Pro 


65 










70 






Phe 


Thr 


Ser 


Trp 


Ser 


Pro 


Cys 


Tyr 










85 








Phe 


Leu 


Arg 


Gly 


Asn 


Pro 


Asn 


Leu 








100 










Leu 


Tyr 


Phe 


Cys 


Glu 


Asp 


Arg 


Lys 






115 










120 


Leu 


His 


Arg 


Ala 


Gly 


Val 


Gin 


He 




130 










135 




Phe 


Tyr 


Cys 


Trp 


Asn 


Thr 


Phe 


Val 


145 










150 






Ala 


Trp 


Glu 


Gly 


Leu 


His 


Glu 


Asn 










165 








Arg 


Arg 


He 


Leu 


Leu 


Pro 


Leu 


Tyr 








180 










Phe 


Arg 


Thr 


Leu 


Gly 


Leu 







195 

<210> 6 
<211> 597 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial 
synthetic construct 



Sequence; note = 



Arg 


Lys 


Phe 


Leu 


Tyr 


Gin 


Phe 


Lys 




10 










15 




Arg 


Glu 


Thr 


Tyr 


Leu 


Cys 


Tyr 


Val 


25 










30 






Ser 


Phe 


Ser 


Leu 


Asp 


Phe 


Gly 


Tyr 










45 








Val 


Glu 


Leu 


Leu 


Phe 


Leu 


Arg 


Tyr 








60 










Gly 


Arg 


Cys 


Tyr 


Arg 


Val 


Thr 


Trp 






75 










80 


Asp 


Cys 


Ala 


Arg 


His 


Val 


Ala 


Asp 




90 










95 




Ser 


Leu 


Arg 


He 


Phe 


Thr 


Ala 


Arg 


105 










110 






Ala 


Glu 


Pro 


Glu 


Gly 


Leu 


Arg 


Arg 










125 








Ala 


He 


Met 


Thr 


Phe 


Lys 


Asp 


Tyr 








140 










Glu 


Asn 


His 


Glu 


Arg 


Thr 


Phe 


Lys 






155 










160 


Ser 


Val 


Arg 


Leu 


Ser 


Arg 


Gin 


Leu 




170 










175 




Glu 


Val 


Asp 


Asp 


Leu 


Arg 


Asp 


Ala 


185 










190 







Sequence; note = 
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<400> 6 

atggacagcc tcttgatgaa ccggaggaag tttctttacc aattcaaaaa tgtccgctgg 60 

gctaagggtc ggcgtgagac ctacctgtgc tacgtagtga agaggcgtga cagtgctaca 12 0 

tccttttcac tggactttgg ttatcttcgc aataagaacg gctgccacgt ggaattgctc 180 

ttcctccgct acatctcgga ctgggaccta gaccctggcc gctgctaccg cgtcacctgg 240 

ttcacctcct ggagcccctg ctacgactgt gcccgacatg tggccgactt tctgcgaggg 3 00 

aaccccaacc tcagtctgag gatcttcacc gcgcgcctct acttctgtga ggaccgcaag 360 

gctgagcccg aggggctgcg gcggctgcac cgcgccgggg tgcaaatagc catcatgacc 42 0 

ttcaaagatt atttttactg ctggaatact tttgtagaaa accatgaaag aactttcaaa 4 80 

gcctgggaag ggctgcatga aaattcagtt cgtctctcca gacagcttcg gcgcatcctt 540 

ttgcccctgt atgaggttga tgacttacga gacgcatttc gtactttggg actttga 597 



<210> 7 
<211> 4 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence; note = 
synthetic construct 

<400> 7 

Trp Arg Glx Tyr 
1 

<210> 8 
<211> 1770 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence; note = 
synthetic construct 



<400> 8 

ctcaatggaa tcaaatcaca aatccgggga tggattgagc ggcactcaga aggaagcagc 60 

cctccgcgca ctggtccagc gcacaggata tagcttggtc caggaaaatg gacaaagaaa 12 0 

atatggtggc cctccacctg gttgggatgc tgcaccccct gaaaggggct gtgaaatttt 180 

tattggaaaa cttccccgag acctttttga ggatgagctt ataccattat gtgaaaaaat 240 

cggtaaaatt tatgaaatga gaatgatgat ggattttaat ggcaacaata gaggatatgc 300 

atttgtaaca ttttcaaata aagtggaagc caagaatgca atcaagcaac ttaataatta 360 

tgaaattaga aatgggcgcc tcttaggggt ttgtgccagt gtggacaact gccgattatt 42 0 

tgttgggggc atcccaaaaa ccaaaaagag agaagaaatc ttatcggaga tgaaaaaggt 4 80 

tactgaaggt gttgtcgatg tcatcgtcta cccaagcgct gcagataaaa ccaaaaaccg 54 0 

aggctttgcc ttcgtggagt atgagagtca tcgagcagct gccatggcga ggaggaaact 6 00 

gctaccagga agaattcagt tatggggaca tggtattgca gtagactggg cagagccaga 660 

agtagaagtt gatgaagata caatgtcttc agtgaaaatc ctatatgtaa gaaatcttat 72 0 

gctgtctacc tctgaagaga tgattgaaaa ggaattcaac aatatcaaac caggtgctgt 780 

ggagagggtg aagaaaattc gagactatgc ttttgtgcac ttcagtaacc gagaagatgc 840 

agttgaggct atgaaagctt taaatggcaa ggtgctggat ggttccccca ttgaagtcac 900 

cctagcaaaa ccagtggaca aggacagtta tgttaggtat acccgaggca caggtggaag 96 0 

gggcaccatg ctgcaaggag agtataccta ctctttgggc caagtttatg atcccaccac 1020 

aacctacctt ggagctcctg tcttctatgc cccccagacc tatgcagcaa ttcccagtct 1080 

tcatttccca gccaccaaag gacatctcag caacagagcc attatccgag ccccttctgt 1140 

tagaggggct gcgggagtga gaggactggg cggccgtggc tatttggcat acacaggcct 12 00 

gggtcgagga taccaggtca aaggagacaa aagagaagac aaactctatg acattttacc 12 60 

tgggatggag ctcaccccaa tgaatcctgt cacattaaaa ccccaaggaa ttaaactcgc 1320 

tccccagata ttagaagaga tttgtcagaa aaataactgg ggacagccag tgtaccagct 13 8 0 

gcactctgct attggacaag accaaagaca gctattcttg tacaaaataa ctattcctgc 1440 

tctagccagc cagaatcctg caatccaccc tttcacacct ccaaagctga gtgcctttgt 1500 
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ggatgaagca aagacgtatg cagccgaata caccctgcag accctgggca tccccactga 1560 

tggaggcgat ggcaccatgg ctactgctgc tgctgctgct actgctttcc caggatatgc 162 0 

tgtccctaat gcaactgcac ccgtgtctgc agcccagctc aagcaagcgg taacccttgg 1680 

acaagactta gcagcatata caacctatga ggtctaccca acttttgcag tgactgcccg 1740 

aggggatgga tatggcacct tctgaagatg 177 0 



<210> 9 
<211> 1717 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence; note = 
synthetic construct 



<400> 9 

ctgccagggg gagggcccca gagaaaacca gaaagagggt gagagactga ggaagataaa 60 

gcgtcccagg gcctcctaca ccagcgcctg agcaggaagc gggaggggcc atgactacga 120 

ggccctggga ggtcacttta gggagggctg tcctaaaacc agaagcttgg agcagaaagt 18 0 

gaaaccctgg tgctccagac aaagatctta gtcgggacta gccggccaag gatgaagcct 24 0 

cacttcagaa acacagtgga gcgaatgtat cgagacacat tctcctacaa cttttataat 300 

agacccatcc tttctcgtcg gaataccgtc tggctgtgct acgaagtgaa aacaaagggt 3 60 

ccctcaaggc cccctttgga cgcaaagatc tttcgaggcc aggtgtattc cgaacttaag 420 

taccacccag agatgagatt cttccactgg ttcagcaagt ggaggaagct gcatcgtgac 480 

caggagtatg aggtcacctg gtacatatcc tggagcccct gcacaaagtg tacaagggat 54 0 

atggccacgt tcctggccga ggacccgaag gttaccctga ccatcttcgt tgcccgcctc 600 

tactacttct gggacccaga ttaccaggag gcgcttcgca gcctgtgtca gaaaagagac 660 

ggtccgcgtg ccaccatgaa gatcatgaat tatgacgaat ttcagcactg ttggagcaag 720 

ttcgtgtaca gccaaagaga gctatttgag ccttggaata atctgcctaa atattatata 780 

ttactgcaca tcatgctggg ggagattctc agacactcga tggatccacc cacattcact 84 0 

ttcaacttta acaatgaacc ttgggtcaga ggacggcatg agacttacct gtgttatgag 900 

gtggagcgca tgcacaatga cacctgggtc ctgctgaacc agcgcagggg ctttctatgc 960 

aaccaggctc cacataaaca cggtttcctt gaaggccgcc atgcagagct gtgcttcctg 102 0 

gacgtgattc ccttttggaa gctggacctg gaccaggact acagggttac ctgcttcacc 1080 

tcctggagcc cctgcttcag ctgtgcccag gaaatggcta aattcatttc aaaaaacaaa 114 0 

cacgtgagcc tgtgcatctt cactgcccgc atctatgatg atcaaggaag atgtcaggag 1200 

gggctgcgca ccctggccga ggctggggcc aaaatttcaa taatgacata cagtgaattt 1260 

aagcactgct gggacacctt tgtggaccac cagggatgtc ccttccagcc ctgggatgga 132 0 

ctagatgagc acagccaaga cctgagtggg aggctgcggg ccattctcca gaatcaggaa 1380 

aactgaagga tgggcctcag tctctaagga aggcagagac ctgggttgag cctcagaata 1440 

aaagatcttc ttccaagaaa tgcaaacagg ctgttcacca ccatctccag ctgatcacag 1500 

acaccagcaa agcaatgcac tcctgaccaa gtagattctt ttaaaaatta gagtgcatta 1560 

ctttgaatca aaaatttatt tatatttcaa gaataaagta ctaagattgt gctcaataca 1620 

cagaaaagtt tcaaacctac taatccagcg acaatttgaa tcggttttgt aggtagagga 1680 

ataaaatgaa atactaaatc tttctgtaaa aaaaaaa 1717 



<210> 10 
<211> 384 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence; note = 
synthetic construct 

<400> 10 

Met Lys Pro His Phe Arg Asn Thr Val Glu Arg Met Tyr Arg Asp Thr 

15 10 15 

Phe Ser Tyr Asn Phe Tyr Asn Arg Pro lie Leu Ser Arg Arg Asn Thr 

20 25 30 

Val Trp Leu Cys Tyr Glu Val Lys Thr Lys Gly Pro Ser Arg Pro Pro 
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35 










40 


Leu 


Asp 


Ala 


Lys 


He 


Phe 


Arg 


Gly 




50 










55 




His 


Pro 


Glu 


Met 


Arg 


Phe 


Phe 


His 


65 










70 






His 


Arg 


Asp 


Gin 


Glu 


Tyr 


Glu 


Val 










85 








Cys 


Thr 


Lys 


Cys 


Thr 


Arg 


Asp 


Met 








100 










Lys 


Val 


Thr 


Leu 


Thr 


He 


Phe 


Val 






115 










120 


Pro 


Asp 


Tyr 


Gin 


Glu 


Ala 


Leu 


Arg 




13 0 










135 




Pro 


Arg 


Ala 


Thr 


Met 


Lys 


He 


Met 


145 










150 






Trp 


Ser 


Lys 


Phe 


Val 


Tyr 


Ser 


Gin 










165 








Asn 


Leu 


Pro 


Lys 


Tyr 


Tyr 


He 


Leu 








180 










Leu 


Arg 


His 


Ser 


Met 


Asp 


Pro 


Pro 






195 










200 


Glu 


Pro 


Trp 


Val 


Arg Gly Arg 


His 




210 










215 




Glu 


Arg 


Met 


His 


Asn 


Asp 


Thr 


Trp 


225 










230 






Phe 


Leu 


Cys 


Asn 


Gin 


Ala 


Pro 


His 










245 








His 


Ala 


Glu 


Leu 


Cys 


Phe 


Leu 


Asp 








260 










Leu Asp 


Gin 


Asp 


Tyr 


Arg 


Val 


Thr 






275 










280 


Phe 


Ser 


Cys 


Ala 


Gin 


Glu 


Met 


Ala 




290 










295 




Val 


Ser 


Leu 


Cys 


He 


Phe 


Thr 


Ala 


305 










310 






Cys 


Gin 


Glu 


Gly 


Leu 


Arg 


Thr 


Leu 










325 








He 


Met 


Thr 


Tyr 


Ser 


Glu 


Phe 


Lys 








340 










His 


Gin 


Gly Cys 


Pro 


Phe 


Gin 


Pro 






355 










360 


Gin 


Asp 


Leu 


Ser 


Gly Arg 


Leu 


Arg 




370 










375 





<210> 11 
<211> 8 
<212> RNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial 
synthetic construct 

<220> 

<221> misc_feature 
<222> 4 

<223> n can be a or u 



45 



Gin 


Val 


Tyr 


Ser 


Glu 


Leu 


Lys 


Tyr 








60 










Trp 


Phe 


Ser 


Lys 


Trp 


Arg 


Lys 


Leu 






75 










80 


Thr 


Trp 


Tyr 


He 


Ser 


Trp 


Ser 


Pro 




90 










95 




Ala 


Thr 


Phe 


Leu 


Ala 


Glu 


Asp 


Pro 


105 










110 






Ala 


Arg 


Leu 


Tyr 


Tyr 


Phe 


Trp 


Asp 










125 








Ser 


Leu 


Cys 


Gin 


Lys 


Arg 


Asp 


Gly 








140 










Asn 


Tyr 


Asp 


Glu 


Phe 


Gin 


His 


Cys 






155 










160 


Arg 


Glu 


Leu 


Phe 


Glu 


Pro 


Trp 


Asn 




170 










175 




Leu 


His 


He 


Met 


Leu 


Gly 


Glu 


He 


185 










190 






Thr 


Phe 


Thr 


Phe 


Asn 


Phe 


Asn 


Asn 










205 








Glu 


Thr 


Tyr 


Leu 


Cys 


Tyr 


Glu 


Val 








220 










Val 


Leu 


Leu 


Asn 


Gin 


Arg 


Arg 


Gly 






235 










240 


Lys 


His 


Gly 


Phe 


Leu 


Glu 


Gly 


Arg 




250 










255 




Val 


He 


Pro 


Phe 


Trp 


Lys 


Leu 


Asp 


265 










270 






Cys 


Phe 


Thr 


Ser 


Trp 


Ser 


Pro 


Cys 










285 








Lys 


Phe 


He 


Ser 


Lys 


Asn 


Lys 


His 








3 00 










Arg 


He 


Tyr 


Asp 


Asp 


Gin 


Gly 


Arg 






315 










320 


Ala 


Glu 


Ala 


Gly 


Ala 


Lys 


He 


Ser 




330 










335 




His 


Cys 


Trp 


Asp 


Thr 


Phe 


Val 


Asp 


345 










350 






Trp 


Asp 


Gly 


Leu 


Asp 


Glu 


His 


Ser 










365 








Ala 


He 


Leu 


Gin 


Asn 


Gin 


Glu 


Asn 



380 



Sequence; note = 



<220> 

<221> misc feature 



9 
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<222> 6 

<223> n can be a, g, c, or t (u) 

<400> 11 
uuununuu 

<210> 12 
<211> 18 
<212> RNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence; note 
synthetic construct 

<400> 12 

caauuugauc aguauauu 

<210> 13 
<211> 21 
<212> RNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence; note 
synthetic construct 

<220> 

<221> misc__f eature 
<222> 5 

<223> n can be a g, c, or t (u) 
<400> 13 

gauanaauuu gaucaguaua u 

<210> 14 
<211> 31 
<212> RNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence; note 
synthetic construct 

<220> 

<221> misc^feature 
<222> 10 

<223> n can be a, g, c or t (u) 
<400> 14 

ccuuuauuan gaauugugau cacauccucu g 

<210> 15 
<211> 31 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence; note 
synthetic construct 



WO 2005/115410 



<220> 

<221> misc^feature 
<222> 10 

<223> n can be a, g, c or t (u) 
<400> 15 

aagcaagttn tttgatcagt ttactcaaac a 

<210> 16 
<211> 31 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence; note 
synthetic construct 

<220> 

<221> misc_feature 
<222> 10 

<223> n can be a, g, c or t (u) 
<400> 16 

aaatgctgcn agaaattgat cagaataagg a 

<210> 17 
<211> 31 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence; note 
synthetic construct 

<220> 

<221> misc_f eature 
<222> 10 

<223> n can be a, g, c or t (u) 
<400> 17 

ttacttgcan caaactgatc agttttgaga g 

<210> 18 
<211> 32 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence; note 
synthetic construct 

<220> 

<221> mi sc_f eature 
<222> 10 

<223> n can be a, g, c or t (u) 
<400> 18 

aggtacgttn tggatgatca gtacacaagt tc 

<210> 19 
<211> 4 
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<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence; note = 
synthetic construct 

<400> 19 
Trp Arg Cys Tyr 
1 

<210> 20 
<211> 8 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence; note = 
synthetic construct 

<220> 

<221> VARIANT 

<222> 1, 2, 7, 8 

<22 3> Xaa = any amino acid 

<400> 20 

Xaa Xaa Trp Arg Glx Tyr Xaa Xaa 
1 5 

<210> 21 
<211> 14 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence; note = 
synthetic construct 

<220> 

<221> VARIANT 

<222> 5, 8, 14 

<223> Xaa = any amino acid 

<400> 21 

Arg Gly Thr Trp Xaa Trp Arg Xaa Tyr Arg Gly Thr Trp Xaa 
15 10 

<210> 22 
<211> 16 
<212> DNA 

<213> Artificial Sequence' 
<220> 

<223> Description of Artificial Sequence; note = 
synthetic construct 

<220> 

<221> misc_feature 
<222> 1, 8 

<223> n can be a, g, c, or t (u) 
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<400> 22 

nagctagnta agttat 16 
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